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(57) Abstract 

The present invention relates to the isolation, identification and characterization of nucleotide sequences encoding an enzyme catalysing 
the transfer of fatty acids from phospholipids to diacylglycerol in the biosynthetic pathway for the production of triacylglycerol, to the said 
enzymes and a process for the production of triacylglycerols. 
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A NEW CLASS OF ENZYMES IN THE BIOSYNTHETIC PATHWAY FOR THE 
PRODUCTION OF TRIACYLGLYCEROL AND RECOMBINANT DNA 
MOLECULES ENCODING THESE ENZYMES 

5 The present invention relates to the isolation, identification and characterization 
of recombinant DNA molecules encoding enzymes catalysing the transfer of 
fatty acids from phospholipids to diacylglycerol in the biosynthetic pathway for 
the production of triacylglyceroL 

10 Triacylglycerol (TAG) is the most common lipid-based energy reserve in nature. 
The main pathway for synthesis of TAG is believed to involve three sequential 
acyl-transfers from acyl-CoA to a glycerol backbone (1, 2). For many years, 
acyl-CoA : diacylglycerol acyltransferase (DAGAT), which catalyzes the third 
acyl transfer reaction, was thought to be the only unique enzyme involved in 

15 TAG synthesis. It acts by diverting diacylglycerol (DAG) from membrane lipid 
synthesis into TAG (2). Genes encoding this enzyme were recently identified 
both in the mouse (3) and in plants (4, 5), and the encoded proteins were 
shown to be homologous to acyl-CoA : cholesterol acyltransferase (AC AT). It 
was also recently reported that another DAGAT exists In the oleaginous fungus 

20 Mortierella ramanniana, which is unrelated to the mouse DAGAT, the ACAT 
gene family or to any other known gene (6)* 

The Instant invention relates to novel type of enzymes and their encoding 
genes for transformation. More specifically, the invention relates to use of a 

25 type of genes encoding a not previously described type of enzymes hereinafter 
designated phospholipidrdiacylglycerol acyltransferases (PDAT), whereby this 
enzyme catalyses an acyl-CoA-independent reaction. The said type of genes 
expressed alone in transgenic organisms will enhance the total amount of oil 
(triacylglycerols) produced in the cells. The PDAT genes, in combination with a 

30 gene for the synthesis of an uncommon fatty acid will, when expressed in 
transgenic organisms, enhance the levels of the uncommon fatty acids in the 
triacylglycerols. 
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There is considerable interest worid-wide in producing chemical feedstoci<, 
such as fatty acids, for industrial use from renewable plant resources rather 
than non-renewable petrochemicals. This concept has broad appeal to 
5 manufacturers and consumers on the basis of resource conservation and 
provides significant opportunity to develop new industrial crops for agriculture. 

There is a diverse array of unusual fatty acids in oils from wild plant species 
and these have been well characterised. Many of these acids have industrial 
10 potential and this has led to interest in domesticating relevant plant species to 
enable agricultural production of particular fatty acids. 

Development in genetic engineering technologies combined with greater 
understanding of the biosynthesis of unusual fatty acids now makes it possible 
15 to transfer genes coding for key enzymes involved In the synthesis of a 
particular fatty acid from a wild species into domesticated oilseed crops. In this 
way individual fatty acids can be produced in high purity and quantities at 
moderate costs. 

20 In all crops like rape, sunflower, oilpalm etc., the oil (i.e. triacylglycerols) is the 
most valuable product of the seeds or fruits and other compounds like starch, 
protein, and fibre is regarded as by-products with less value. Enhancing the 
quantity of oil per weight basis at the expense of other compounds in oil crops 
would therefore increase the value of crop. If genes, regulating tiie allocation of 

25 reduced carbon into the production of oil can be up-regulated, the cells will 
accumulate more oil on the expense of other products. Such genes might not 
only be used in already high oil producing cells, such as oil crops, but could 
also induce significant oil production in moderate or low oil containing crops 
such as e.g. soy, oat. maize, potato, sugarbeats, and turnips as well as in 

30 micro-organisms. 
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Summary of the invention 

Many of the unusual fatty acids of interest, e.g. medium chain fatty acids, 
hydroxy fatty acids, epoxy fatty acids and acetylenic fatty acids, have physical 

5 properties that are distinctly different from the common plant fatty acids. The 
present inventors have found that, in plant species naturally accumulating 
these uncommon fatty acids in their seed oil (i.e. triacylglycerol), these acids 
are absent, or present in very low amounts in the membrane (phospho)lipids of 
the seed. The low concentration of these acids in the membrane lipids is most 

10 likely a prerequisite for proper membrane function and thereby for proper cell 
functions. One aspect of the invention is that seeds of transgenic crops can be 
made to accumulate high amounts of uncommon fatty acids if these fatty acids 
are efficiently removed from the membrane lipids and channelled Into seed 
triacyiglycerols. 



The inventors have identified a novel class of enzymes in plants catalysing the 
transfer of fatty acids from phospholipids to diacylglycerol in the production of 
triacylglycerol through an acyl-CoA-independent reaction and that these 
enzymes (phospholipididiacylglycerol acyltransferases, abbreviated as PDAT) 
20 are involved in the removal of hydroxylated, epoxygenated fatty acids, and 
probably also other uncommon fatty acids such as medium chain fatty acids, 
from phospholipids in plants. 

This enzyme reaction was shown to be present in microsomal preparations 
from baker's yeast {Saccharomyces cerevisiae). The instant invention further 

25 pertains to an enzyme comprising an amino acid sequence as set forth in SEQ 
ID No. 2 or a functional fragment, derivate, allele, homolog or isoenzyme 
thereof. A so called ,knock out' yeast mutant, disrupted in the respective gene 
was obtained and microsomal membranes from the mutant was shown to 
totally lack PDAT activity. Thus, it was proved that the disrupted gene encodes 

30 a PDAT enzyme (SEQ ID NO. 1 and 2). Furthenm, this PDAT enzyme is 
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Characterized through the amino acid sequence as set forth in SEQ ID NO 2 
containing a lipase motif of ttie consented sequence string FXFCWVEA. 
The instant invention pertains further to an enzyme comprising an amino acid 
sequence as set forth in SEQ ID NO. 1a, 2b or 5a or a functional fragment. 
5 derivate, allele, homolog or isoenzyme thereof. 

Further genes and/or proteins of so far unknown function were identified and 
are contemplated within the scope of the instant invention. A gene from 
Schizosaccharomyces pombe, SPBC776.14 (SEQ ID. NO. 3), a putative open 
reading frame CAA22887 of the SPBC776.14 (SEQ ID NO. 13) were identified. 
10 Further Arabidopsis thaliana genomic sequences (SEQ ID NO. 4, 10 and 11) 
coding for putative proteins were identified, as well as a putative open reading 
frame AAC80628 from the A. thaliana locus AC 004557 (SEQ ID NO. 14) and a 
putative open reading frame AAD10668 from the A. thaliana locus AC 003027 
(SEQ ID NO. 15) were identified. 
15 Also, a partially sequenced cDNA clone from Neurospora crassa (SEQ ID NO. 
9) and a Zea mays EST (Extended Sequence Tac) clone (SEQ ID NO. 7) and 
corresponding putative amino acid sequence (SEQ ID NO. 8) were identified. 
Finally, two cDNA clones were identified, one Arabidopsis thaliana EST (SEQ 
ID NO. 5 and corresponding predicted amino acid sequence SEQ ID NO. 6) 
20 and a Lycopersicon esculentum EST clone (SEQ ID NO. 12) were identified. 
Further, enzymes designated as PDAT comprising an amino acid sequence 
selected from the group consisting of sequences as set forth in SEQ ID NO 2a, 
3a, 5b, 6 or 7b containing a lipase motif FXKWVEA are contemplated within the 
scope of the invention. Moreover, an enzyme comprising an amino acid 
25 sequence encoded through a nucleotide sequence, a portion, derivate, allele or 
homolog thereof selected from tiie group consisting of sequences as set forth 
in SEQ ID No. 1, lb, 3, 3b, 4, 4a, 4b, 5, 5b, 6b, 7. 8b, 9. 9b, 10, 10b, 11. lib or 
12 or a functional fragment, derivate, allele, homolog or isoenzyme of the 
enzyme encoding amino acid sequence are included within the scope of the 
30 Invention. 
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A functional fragment of the instant enzyme is understood to be any 
polypeptide sequence which shows specific enzyme activity of a 
phospholipididiacylglycerol acyltransf erase (PDAT), The length of the functional 
fragment can for example vary In a range from about 660 ±10 amino acids to 
660 ± 250 amino acids, preferably from about 660 ± 50 to 660 ±100 amino 
acids, whereby the „basjc number*' of 660 amino acids corresponds in this case 
to the polypeptide chain of the PDAT enzyme of SEQ ID NO. 2 encoded by a 
nucleotide sequence according to SEQ ID NO. 1. Consequently, the „basic 
number" of functional fullength enzyme can vary in correspondance to the 
encoding nucleotide sequence. 

A portion of the instant nucleotide sequence is meant to be any nucleotide 
sequence encoding a poiypeptid which shows specific activity of a 
phospholipidrdiacylglycerol acyltransferase (PDAT). The length of the 
nucleotide portion can vary in a wide range of about several hundreds of 
nucleotides based upon the coding region of the gene or a highly conserved 
sequence. For example the length varies in a range form about 1900 ± 10 to 
1900 ± 1000 nucleotides, preferably form about 1900 ± 50 to 1900 ±700 and 
more preferably form about 1900 ± 100 to 1900 ± 500 nucleotides, whereby the 
„basic number" of 1900 nucleotides corresponds in this case to the encoding 
nucleotide sequence of the PDAT enzyme of SEQ ID NO. 1. Consequently, the 
„basic number" of functional fullength gene can vary. 

An allelic variant of the instant nucleotide sequence is understood to be any 
different nucleotide sequence which encodes a polypeptide with a functionally 
equivalent function. The alleles pertain naturally occuring variants of the instant 
nucleotide sequences as well as synthetic nucleotide sequences produced by 
methods known in the art. Contemplated are even altered nucleotide 
sequences which result in an enzyme with altered activity and/or regulation or 
which is resistant against specific inhibitors. The instant invention further 
includes natural or synthetic mutations of the originally isolated nucleotide 
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sequences. These mutations can be substitution, addition, deletion, inversion 
or insertion of one or more nucleotides. 

A homologous nucleotide sequence is understood to be a complementary 
5 sequence and/or a sequence which specifically hybridizes with the instant 
nucleotide sequence. Hybridizing sequences include similar sequences 
selected from the group of DNA or RNA which specifically interact to the instant 
nucleotide sequences under at least moderate stringency conditions which are 
known in the art. A preferred, non-limiting example of stringent hybridization 
10 conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at 
about 45°C, followed by one or more washes in 0.2 X SSC, 0.1% SDS at 50- 
65*C. This further includes short nucleotide sequences of e.g. 10 to 30 
nucleotides, preferably 12 to 15 nucleotides. Included are also primer or 
hybridization probes. 



A homologous nucleotide sequence included within the scope of the instant 
invention is a sequence which is at least about 40%, preferably at least about 
50 % or 60%, and more preferably at least about 70%, 80% or 90% and most 
preferably at least about 95%, 96%, 97%, 98% or 99% or more homologous to 
20 a nucleotide sequence of SEQ ID NO. 1 . 

All of the aforementioned definitions are true for amino acid seqences and 
functional enzymes and can easily transfered by a person skilled in the art. 

isoenzymes are understood to be enzymes which have the same or a similar 
25 substrate specifity and/or catalytic activity but a different primary structure. 

In a first embodiment, this invention is directed to nucleic acid sequences that 
encode a PDAT. This includes sequences that encode biologically active 
PDATs as well as sequences that are to be used as probes, vectors for 
30 transformation or cloning intermediates. The PDAT encoding sequence may 
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encode a complete or partial sequence depending upon tlie intended use. All or 
a portion of the genomic sequence, cDNA sequence, precursor PDAT or 
mature PDAT is intended. 

5 Further included is a nucleotide sequence selected from the group consisting of 
sequences set forth in SEQ ID No. 1 , 1b, 3, 3b, 4, 4a, 4b, 9b, 10, 10b or 11 or a 
portion, derivate, allele or homolog thereof. The invention pertains a partial 
nucleotide sequence corresponding to a fullength nucleotide sequence selected 
from the group consisting of sequences set forth in SEQ ID No. 5, 5b, 6b, 7, 8b, 

10 9, 11b or 12 or a portion, derivate, allele or homolog thereof. Moreover, a 
nucleotide sequence comprising a nucleotide sequence which is at least 40% 
homologous to a nucleotide sequence selected form the group consisting of 
those sequences set forth in SEQ ID No. 1 lb, 3, 3b, 4, 4a, 4b, 5, 5b. 6b, 7, 8b. 
9, 9b. 10, 10b, 1 1 , 1 lb or 12 is contemplated within the scope of the invention. 



The instant invention pertains to a gene construct comprising a said nucleotide 
sequences of the instant invention which is operably linked to a heterologous 
nucleic acid. 

The term operably linked means a serial organisation e.g. of a promoter, coding 
20 sequence, terminator and/or further regulatory elements whereby each element 
can fulfill its original function during expression of the nucleotide sequence. 

Further, a vector comprising of a said nucleotide sequence of the instant 
invention is contemplated in the instant invention. This includes also an 
25 expression vector as well as a vector further comprising a selectable marker 
gene and/or nucleotide sequences for the replication in a host cell and/or the 
integration into the genome of the host cell. 

In a different aspect, this invention relates to a method for producing a PDAT in 
30 a host cell or progeny thereof, including genetically engineered oil seeds, yeast 
and moulds or any other oil accumulating organism, via the expression of a 
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construct in the cell. Cells containing a PDAT as a result of the production of 
the PDAT encoding sequence are also contemplated within the scope of the 
invention. 

5 Further, the invention pertains a transgenic cell or organism containing a said 
nucleotide sequence and/or a said gene construct and/or a said vector. The 
object of the instant invention is further a transgenic cell or organism which is 
an eucaryotic cell or organism. Preferably, the transgenic cell or organism is a 
yeast cell or a plant cell or a plant. The instant invention further pertains said 

10 transgenic cell or organism having an altered biosynthetic pathway for the 
production of triacylglycerol. A transgenic cell or organism having an altered oil 
content is also contemplated within the scope of this invention. 

Further, the invention pertains a transgenic cell or organism wherein the acti>nty 
15 of PDAT is altered in said cell or organism. This altered activity of PDAT is 
characterized by an alteration in gene expression, catalytic activity and/or 
regulation of activity of the enzyme. Moreover, a transgenic cell or organism is 
included in the instant invention, wherein the altered biosynthetic pathway for 
the production of triacylglycerol Is characterized by the prevention of 
20 accumulation of undesirable fatty acids in the membrane lipids. 

In a different emtjodiment, this invention also relates to methods of using a 
DNA sequence encoding a PDAT for increasing the oil-content within a cell. 

25 Another aspect of the invention relates to the accommodation of high amounts 
of uncomman fatty acids in the triacylglycerol produced within a cell, by 
introducing a DNA sequence producing a PDAT that specifically removes these 
fatty acids from the membrane lipids of the cell and channel them into 
triacylglycerol. Plant cells having such a modification are also contemplated 

30 herein. 
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Further, the Invention pertains a process for the production of triacylglycerol, 
comprising growing a said transgenic cell or organism under conditions 
whereby the said nucleotide sequence is expressed and whereby the said 
transgenic cells comprising a said enzyme catalysing the transfer of fatty acids 
5 from phospholipids to diacylglycerol forming triacylglycerol. 

Moreover, triacylglycerols produced by the aforementioned process are 
included in scope of the instant invention. 

10 Object of the Instant invention is further the use of an instant nucleotide 
sequence and/or a said enzyme for the production of triacylglycerol and/or 
triacylglycerols with uncommon fatty acids. The use of a said instant nucleotide 
sequence and/or a said enzyme of ttie instant invention for the transfomiation 
of any cell or organism in order to be expressed in this cell or organism and 

15 result in an altered, preferably increased oil content of this cell or organism is 
also contemplated within the scope of the instant invention, 

A PDAT of this invention includes any sequence of amino acids, such as a 
protein, polypeptide or peptide fragment obtainable from a microorganism, 
20 animal or plant source that demonstrates the ability to catalyse the production 
of triacylglycerol from a phospholipid and diacylglycerol under enzyme reactive 
conditions. By „enzyme reactive conditions" is meant that any necessary 
conditions are available in an environment (e.g., such factors as temperature, 
pH, lack of inhibiting substances) which will permit the enzyme to function. 



Other PDATs are obtainable from the specific sequences provided herein. • 
Furthermore, it will be apparent that one can obtain natural and synthetic 
PDATs, including modified amino acid sequences and starting materials for 
synthetic-protein modelling from the examplified PDATs and from PDATs which 
30 are obtained through the use of such examplified sequences. Modified amino 
acid sequences include sequences that have been mutated, truncated, 
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increased and the like, wiiether such sequences were partially or wholly 
synthesised. Sequences that are actually purified from plant preparations or 
are identical or encode identical proteins thereto, regardless of the method 
used to obtain the protein or sequence, are equally considered naturally 



Further, the nucleic acid probes (DNA and RNA) of the present invention can 
be used to screen and recover ^homologous" or ^related" PDATs from a variety 
of plant and microbial sources. 



Further, it is also apparent that a person skilled in the art can, with the 
information provided in this application, in any organism identify a PDAT 
activity, purify an enzyme with this activity and thereby identify a „non- 
homologous"" nucleic acid sequence encoding such an enzyme. 



The present invention can be essentially characterized by the following 
aspects: 

1 . Use of a PDAT gene (genomic clone or cDNA) for transformation. 
20 2. Use of a DNA molecule according to item 1 wherein said DNA is used for 
transformation of any organism in order to be expressed in this organism 
and result in an active recombinant PDAT enzyme in order to increase oil 
content of the organism. 

3. Use of a DNA molecule of item 1 wherein said DNA is used for 
25 transformation of any organism in order to prevent the accumulation of 

undesirable fatty acids in the membrane lipids. 

4. Use according to item 1 , wherein said PDAT gene is used for transforming 
transgenic oil accumulating organisms engineered to produce any 
uncommon fatty acid which is harmful if present in high amounts in 

30 membrane lipids, such as medium chain fatty acids, hydroxylated fatty 
acids, epoxygenated fatty acids and acetylenic fatty acids. 
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5. Use according to item 1 , wlierein said PDAT gene is used for transforming 
organisms, and wlierein said organisms are crossed with other oil 
accummulating organisms engineered to produce any uncommon fatty acid 
which is harmful if present in high amounts in membrane lipids, comprising 

5 medium chain fatty acids, hydroxylated fatty acids, epoxygenated fatty 
acids and acetylenic fatty acids. 

6. Use according to item 1 , v\^erein the enzyme encoded by said PDAT gene 
or cDNA is coding for a PDAT with distinct acyl specificity. 

7. Use according to item 1 wherein said PDAT encoding gene or cDNA, is 
10 derived from Saccharomyces cereviseae, or contain nucleotide sequences 

coding for an amino acid sequence 30% or more identical to the amino acid 
sequence of PDAT as presented In SEQ. ID. NO. 2. 

8. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharomyces cereviseae, or contain nucleotide sequences 

15 coding for an amino acid sequence 40% or more identical to the amino acid 
sequence of PDAT as presented in SEQ. ID. NO. 2. 

9. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharomyces cereviseae, or contain nucleotide sequences 
coding for an amino acid sequence 60% or more identical \o the amino acid 

20 sequence of PDAT as presented in SEQ. ID. NO. 2. 

10. Use according to Item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharomyces cereviseae, or contain nucleotide sequences 
coding for an amino acid sequence 80% or more identical to the amino acid 
sequence of PDAT as presented in SEQ, ID. NO. 2. 

25 11. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from plants or contain nucleotide sequences coding for an amino 
acid sequence 40% or more identical to the amino acid sequence of PDAT 
from Arabidopsis thaliana or to the protein encoded by the fullength 
counterpart of the partial Zea mays, Lycopericon esculentum, or 

30 Neurospora crassa cDNA clones. 
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12. Transgenic oil accumulating organisms comprising, in their genome, a 
PDAT gene transferred by recombinant DNA technology or somatic 
hybridization. 

13. Transgenic oil accumulating organisms according to item 12 comprising, in 
5 their genome, a PDAT gene having specificity for substrates with a 

particular uncommon fatty acid and the gene for said uncommon fatty acid. 

14. Transgenic organisms according to item 12 or 13 which are selected from 
the group consisting of fungi, plants and animals. 

15. Transgenic organisms according to item 12 or 13 which are selected from 
10 the group of agricultural plants. 

16. Transgenic organisms according to item 12 or 13 which are selected from 
tfie group of agricultural plants and where said PDAT gene is expressed 
under the control of a storage organ specific promoter. 

17. Transgenic organisms according to Item 12 or 13 which are selected from 
15 the group of agricuiUjral plants and where said PDAT gene is expressed 

under the control of a seed promoter. 

18. Oils from organisms according to item 12-17. 

19. A method for altering acyl specificity of a PDAT by alteration of the 
nucleotide sequence of a naturally occurring encoding gene and as a 

20 consequence of this alternation creating a gene encoding for an enzyme 
with novel acyl specifity. 

20. A protein encoded by a DNA molecule according to item 1 or a functional 
fragment thereof. 

21. A protein of item 20 designated phospholipid:diacylglycerol acyltransferase. 
25 22. A protein of item 21 which has a distinct acyl specificity. 

23. A protein of item 13 having the amino acid sequence as set forth in SEQ, 
ID NO. 2, 13, 14 or 15 (and the proteins encoded by the fullength or partial 
genes set forth in SEQ. ID. NO. 1, 3, 4, 5, 7, 9, 10, 11 or 12) or an amino 
acid sequence with at least 30 % homology to said amino acid sequence. 
30 24. A protein of item 23 isolated from Saccharomyces cereviseae. 
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General methods: 

Yeast strains and olasmids . The wild type yeast strains used were either 
FY1679 {MATa his3'A200 Ieu2-A1 trp1-A6 ura3-52) or W303-1 A {MATa ADE2' 

5 1 cam -100 his3'1t15 Ieu2'3,112 trpUl ura3'1) (7). The YNR008w::KanMX2 
disruption strain FVKT004-04C(AL), which is congenic to F/1679, was 
obtained from the Euroscarf collection (8). A 2751 bp fragment containing the 
YNROOBw gene with 583 bp of 5' and 183 bp of 3' flanking DNA was amplified 
from W303-1A genomic DNA using Tag polymerase with 5'- 

10 TCTCCATCTTCTGCAAAACCT-3* and 5'-CCTGTCAAAAACCTTCTCCTC-3' as 
primers. The resulting PGR product was purified by agarose gel electrophoresis 
and cloned into the EcoRV site of pBluescrlpt (pbluescript-pdat). For 
complementation experiments, the cloned fragment was released from 
pBluescript by H/ndlll-Sad digestion and then cloned between the H/ndlll and 

15 Sad sites of pFL39 (9), thus generating pUSI. For overexpression of the PDAT 
gene, a 2202 bp EcoRI fragment from the pBluscript plasmid which contains 
only 24 bp of 5' flanking DNA was cloned into the BamHI site of the GALI- 
TP/C2 expression vector pJN92 (12), thus generating pUS4, 

20 Microsomal preparations. Microsomes from developing seeds of sunflower 

{Helianthus annuus), Ricinus communis and Crepis palaestina were prepared 
using the procedure of Stobart and Stymne (11). To obtain yeast microsomes, 
1g of yeast cells (fresh weight) was re-suspended in 8 ml of ice-cold buffer (20 
mM Tris-CI, pH 7.9, 10 mM MgCla. 1 mM EDTA, 5 % (v/v) glycerol. 1 mM DTT, 

25 0,3 M ammonium sulfate) in a 12 ml glass tube. To this tube, 4 ml of glass 
beads (diameter 0.45-0.5 mm) were added, and the tube was then heavily 
shaken (3 x 60 s) in an MSK cell homogenizer (B. Braun Melsungen AG, 
Germany). The homogenized suspension was centrifuged at 20,000 x g for 15 
min at 6*^0 and the resulting supernatant was again centrifuged at 100,000 x g 

30 for 2 h at B^'C. The 100,000 x g pellet was resuspended in 0,1 M potassium 
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phosphate (pH 7.2), and stored at -SCC. It is subsequently referred to as the 
crude yeast microsomal fraction. 

Lipid substrates. Radio-labeled ricinoleic (12-hydroxy-9-octadecenoic) and 
5 vernolic (12,13-epoxy-9-octadecenoic) acids were synthesized enzymatically 
from [1-^*C]oleic acid and [l-^^CJIinoleic acid, respectively, by incubation witii 
microsomal preparations from seeds of Ricinus communis and Crepis 
palaestina, respectively (12). The synthesis of phosphatidylcholines (PC) or 
phosphatidylethanolamines (PE) with ^"^C-labeled acyl groups in tiie sn-2 
10 position was performed using either enzymatic (13), or synthetic (14) acylation 
of [^*C]oleic, ['*C]ricinolelc. or [^*C]vernolic acid. Dioleoyi-PC tiiat was labeled 
in the sn-1 position was syntiiesized from sn-l-E^'^Cloleoyl-lyso-PC and 
unlabeled oleic acid as described in (14). S/i-1-oleoyl-sn-2-[^'*C]ricinoleoyl-DAG 
was synthesized from PC by the action of phospholipase C type XI from B. 
15 Cereus (Sigma Chemical Co.) as described In (15). MonovemoloyI- and 
divemoleoyl-DAG were synthesized from TAG extracted from seeds of 
Euphorbia lagascae, using the TAG-lipase (Rizhopus arrhizus, Sigma Chemical 
Co.) as previously described (16). Monoricinoleoyl-TAG was synthesized 
according to the same method using TAG extracted from Castor bean. 

20 

Lioid analysis.- Total lipid composition of yeast were determined from cells 
harvested from a 40 ml liquid culture, broken in a glass-bead shaker and 
extracted into chlorofomri as described by Bligh and Dyer (17). and tiien 
separated by thin layer chromatography in hexane/diethylether/acetic acid 

25 (80:20:1) using pre-coated silica gel 60 plates (Merck). The lipid areas were 
located by brief exposure to I2 vapors and Identified by means of appropriate 
standards. Polar lipids, sterol-esters and triacylglycerols. as well as the 
remaining minor lipid classes, referred to as other lipids, were excised from tine 
plates. Fatty acid methytesters were prepared by heating tiie dry excised 

30 material at 85 "C for 60 min in 2% (v/v) sulfuric acid in dry methanol. The 
methyl esters were extiracted witii hexane and analyzed by GLC through a 50 m 

SUBSTITUTE SHEET (RULE 26) 



JOOCID: <WO_00e009SA2_l_> 





wo 00/60095 



PCT/EPOO/02701 



X 0.32 mm CP-Wax58-CB fused-sliica column (Chrompack). with 
methylheptadecanolc acid as an intemal standard. The fatty acid content of 
each fraction was quantified and used to calculate the relative amount of each 
lipid class. In order to determine the total lipid content, 3 ml aliquots from yeast 
5 cultures were harvested by centrifugation and the resulting pellets were washed 
with distilled water and lyophilized. The weight of the dried cells was 
detennined and the fatty acid content was quantified by GLC-analyses after 
conversion to methylesters as described above. The lipid content was then 
calculated as nmol fatty acid (FA) per mg dry weight yeast. 



Enzyme assays. Aliquots of crude microsomal fractions (corresponding to 
10 nmol of microsomal PC) from developing plant seeds or yeast cells were 
lyophilized over night. ^*C-Labeled substrate lipids dissolved in benzene were 
then added to the dried microsomes. The benzene was evaporated under a 

15 stream of Na, leaving the lipids in direct contact with the membranes, and 0.1 
ml of 50 mM potassium phosphate (pH 7.2) was added. The suspension was 
thoroughly mixed and incubated at SO'C for the time period indicated, up to 90 
min. Lipids were extracted from the reaction mixture using chloroform and 
separated by thin layer chromatography in hexane/dlethylether/acetic acid 

20 (35:70:1.5) using silica gel 60 plates (Merck). The radioactive lipids were 
visualized and quantified on the plates by electronic autoradiography (Instant 
Imager, Packard, US). 

Yeast cultivation. Yeast cells were grown at 28''C on a rotatory shaker in 
25 liquid YPD medium (1% yeast extract, 2% peptone, 2% glucose), synthetic 
medium (18) containing 2% (v/v) glycerol and 2% (v/v) ethanol, or minimal 
medium (19) containing 16 g/l of glycerol. 

The instant invention is further characterized by the following examples which 
30 are not limiting: 



10 
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Acvl-CoA-indeoendent synthesis of TAG bv oil seed microsomes. A large 
number of unusual fatty acids can be found In oil seeds (20). Many of these 
fatty acids, such as riclnoleic (21) and vemolic acids (22), are synthesized 
using phosphatidylcholln (PC) with oleoyi or linoleoyl groups esterlfled to the 
5 sn-2 position, respectively, as the immediate precursor. However, even though 
PC can be a substrate for unusual fatty acid synthesis and Is the major 
membrane lipids in seeds, unusual fatty acids are rarely found in the 
membranes. Instead, they are mainly incorporated into the TAG. A mechanism 
for efficient and selective transfer of these unusual acyl groups from PC into 
10 TAG must therefore ewst in oil seeds that accumulate such unusual fatty acids. 
This transfer reaction was biochemically characterized In seeds from castor 
bean {Ricinus communis) and Crepis palaestina. plants which accumulate high 
levels of rlcindeic and vemolic acid, respectively, and sunflower {Helianthus 
annuus), a plant vvhich has only common fatty acids in its seed oil. Crude 
15 microsomal fractions from developing seeds were incubated witin PC having 
"*C-labeled oleoyi, ricinoleoyi or vemoloyi groups at the sn-2 position. After the 
Incubation, lipids were extracted and analyzed by thin layer chromatography. 
We found that the amount of radioactivity that was incorporated into the neutral 
lipid fraction increased linearly over a period of 4 hours (data not shown). The 
20 distribution of [^*C]acyl groups within the neutral lipid fraction was analyzed 
after 80 min (Rg. 1). Interestingly the amount and distribution of radioactivity 
between diffferent neutral lipids were strongly dependent botti on the plant 
species and on the type of [^'*C3acyl chain. Thus, sunflower microsomes 
incorporated most of the label into DAG, regardless of the type of [^^CJacyl 
25 group. In contrast, R. communis microsomes preferentially Incorporated 
[^*C]ricinoleoyl and [^'^Clvernoloyl groups into TAG, while [^^CJoleyl groups 
mostly were found in DAG. C. palaestina microsomes, finally, incorporated only 
[^*C]vemolyol groups into TAG, with [^"^Clriclnoleyl groups being found mostly 
as free fatty acids, and ['*C]oleyl groups in DAG. This shows that the high in 
30 vivo levels of riclnoleic acid and vemolic acid in the TAG pool of R. communis 
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and C. palaestina, respectively, can be explained by an efficient and selective 
transfer of the corresponding acyl groups from PC to TAG in these organisms. 

The in-vitro synthesis of triacylgiycerols in microsomal preparations of 
5 developing castor bean is summarized in table 1 . 

PDAT: a novel enzyme that catalyzes acvl-CoA independent synthesis of 
TAG. It was investigated if DAG could serve both as an acyl donor as well as 
an acyl acceptor in the reactions catalyzed by the oil seed microsomes. 

10 Thererfore, unlabeled dIvemoloyl-DAG was incubated with either s/>-1 -oleoyl- 
sn-2-[^'^C]riclnoleoyl-DAG or sn-1 -oleoyl-s/>2-[^^C]ricinoleoyl-PC in the 
presence of R. communis microsomes. The synthesis of TAG molecules 
containing both [^^C]ricinoleoyl and vemoloyi groups was 5 fold higher when 
[^"^Clricinoleoyl-PC served as acyl donor as compared to [^"^Clricinoleoyl-DAG 

15 (fig.lB). These data strongly suggests that PC is the immediate acyl donor and 
DAG the acyl acceptor in the acyl-CoA-independent formation of TAG by oil 
seed microsomes. Therefore, this reaction is catalyzed by a new enzyme which 
we call phospholipid : diacylglycerol acyltransf erase (PDAT). 

20 PDAT actiyitv in yeast microsomes. Wild type yeast cells were cultivated 

under conditions where TAG synthesis is induced. Microsomal membranes 
were prepared from these cells and incubated with s/>2-[^'^C]-ricinoleoyl-PC 
and DAG and the ^*C-labeled products formed were analyzed. The PC-derived 
[^^C]riclnoleoyl groups within the neutral lipid fraction mainly were found in free 

25 fatty acids or TAG, and also that the amount of TAG synthesized was 
dependent on the amount of DAG that was added to the reaction (Fig.2). The in 
yitro synthesis of TAG containing both ricinoleoyi and vemoloyi groups, a TAG 
species not present in yivo, from exogenous added sn-2-[^'^C]ricinoleoyl-PC and 
uniabelled vemoloyl-DAG (Fig. 2, lane 3) clearly demonstrates the existence of 

30 an acyl-CoA-independent synthesis of TAG involving PC and DAG as 
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substrates in yeast microsomal membranes. Consequently, TAG synthesis in 
yeast can be catalyzed by an enzyme similar to the PDAT found in plants. 

The PDAT encoding aene in veast 
5 A gene in the yeast genome (YNR008w) is known, but nothing is known about 
the function of YNROOSw, except that the gene is not essential for growth under 
normal circumstances. Microsomal membranes were prepared from the yeast 
strain FVKT004-04C(AL) (8) in which this gene with unknown function had been 
disrupted. PDAT activity in the microsomes were assayed using PC with 
10 radiolabelled fatty acids at the sn-2 position. The activity was found to be 
completely absent in the disruption strain (Fig. 2 lane 4). Significantly, the 
activity could be partially restored by the presence of YNROOSw on the single 
copy plasmid pUSI (Fig. 2 lane 5). Moreover, acyl groups of 
phosphatidylethanolamine (PE) were efficiently incorporated into TAG by 
15 microsomes from the wild type strain whereas no incorporation occured from 
this substrate in the mutant strain (data not shown). This shows that YNROOSw 
encodes a yeast PDAT which catalyzes the transfer of an acyl group from the 
sn-2 position of phospholipids to DAG, thus forming TAG. It should be noted 
that no cholesterol esters were formed from radioactive PC even in incubations 
20 with added ergosterols, nor were the amount of radioactive free fatty acids 
formed from PC affected by disoiption of the YNROOSw gene (data not shown). 
This demonstrates that yeast PDAT do not have cholesterol ester syntheslsing 
or phospholipase activities. 

25 Increased TAG content in veast cells that ov erexoress PDAT. The effect 

of overexpressing the PDAT-encoding gene was studied by transfomning a wild 
type yeast strain with the pUS4 plasmid in which the gene is expressed from 
the galactose-induced GAL1:TPK2 promoter. Cells containing the empty 
expression vector were used as a control. The cells were grown in synthetic 

30 glycerol-ethanol medium, and expression of the gene was induced after either 2 
hours (early log phase) or 25 hours (stationary phase) by the addition of 
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galactose. The cells were then incubated for another 21 hours, after which they 
were harvested and assays were performed. We found that overexpression of 
PDAT had no significant effect on the growth rate as determined by the optical 
density. However, the total lipid content, measured as |jmol fatty acids per mg 

5 yeast dry weight, was 47% (log phase) or 29% (stationary phase) higher in the 
PDAT overexpressing strain than in the control. Furthermore, the polar lipid and 
sterolester content was unaffected by overexpression of PDAT. Instead, the 
elevated lipid content in these cells is entirely due to an increased TAG content 
(Rg. 3A,B). Thus, the amount of TAG was increased by 2-fold in PDAT 

10 overexpressing early log phase cells and by 40% in stationary phase cells. It is 
interesting to note that a significant increase in the TAG content was achieved 
by overexpressing PDAT even under conditions {i.e. in stationary phase) where 
DAGAT is induced and thus contributes significantly to TAG synthesis. In vitro 
PDAT activity assayed in microsomes from the PDAT overexpressing strain 

15 was 7-fold higher than in the control strain, a finding which is consistent with the 
increased levels of TAG that we obsen/ed in vivo (Fig. 3C), These results 
clearly demonstrate the potential use of the PDAT gene In increasing the oil 
content in transgenic organisms. 

20 Substrate specificity of veast PDAT The substrate specificity of yeast 

PDAT was analyzed using microsomes prepared from the PDAT 
overexpressing strain (see Fig, 4). The rate of TAG synthesis, under conditions 
given in figure 4 with dl-oleoyl-PC as the acyl-donor, was 0.15 nmol per min 
and mg protein. With both oleoyi groups of PC labeled it was possible, under 

25 the given assay conditions, to detect the transfer of 11 pmol/min of [^"^Cloleoyl 
chain into TAG and the formation of 15 pmol/min of lyso-PC. In microsomes 
from the PDAT-deficient strain, no TAG at all and only trace amounts of lyso- 
PC was detected, strongly suggesting that yeast PDAT catalyses the formation 
of equimolar amounts of TAG and lyso-PC when supplied with PC and DAG as 

30 substrates. The fact that somewhat more lyso-PC than TAG is formed can be 
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explained by the presence of a phospholipase In yeast microsomes, which 
produces lyso-PC and unesterified fatty acids from PC. 

The specificity of yeast PDAT for different acyl group positions was 
5 investigated by incubating the microsomes with di-oleoyi-PC carrying a 
[^"^Clacyl group either at the sn-1 position (Fig. 4A bar 2) or the sn-2 position 
(Fig. 4A bar 3). We found that the major ^"^C-labeled product formed in the 
former case was lyso-PC, and in the latter case TAG. We conclude that yeast 
PDAT has a specificity for the transfer of acyl groups from the sn-2 position of 
10 the phospholipid to DAG, thus forming sn-1 -lyso-PC and TAG. Under the given 
assay conditions, trace amounts of ^*C-labelled DAG is formed from the sn-1 
labeled PC by the reversible action of a CDP-choline : choline 
phosphotransferase. This labeled DAG can then be further converted into TAG 
by the PDAT activity. It Is therefore not possible to distinguish whether the 
15 minor amounts of labeled TAG that Is formed in the presence of di-oleoyl-PC 
carrying a [^"^Clacyl group in the sn-1 position, is synthesized directly from the 
sn-1 -labeled PC by a PDAT that also can act on the sn-1 postion, or if It Is first 
converted to sn-1 -labeled DAG and then acylated by a PDAT with strict 
selectivity for the transfer of acyl groups at the sn-2 position of PC. Taken 
20 together, this shows that the PDAT encoded by YNROOSw catalyses an acyl 
transfer from the sn-2 position of PC to DAG, thus causing the formation of 
TAG and lyso-PC. 

The substrate specificity of yeast PDAT was further analyzed with respect 
25 to the headgroup of the acyl donor, the acyl group transferred and the acyl 
chains of the acceptor DAG molecule. The two major membrane lipids of S. 
cerevisiae are PC and PE, and as shown in Fig. 4B (bars 1 and 2), dioleoyl-PE 
is nearly 4-fold more efficient than dioleoyl-PC as acyl donor in the PDAT- 
catalyzed reaction. Moreover, the rate of acyl transfer is strongly dependent on 
30 the type of acyl group that is transfenred. Thus, a ricinoleoyi group at the sn-2 
position of PC is 2.5 times more efficiently transferred into TAG than an oleoyi 
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group In the same position (Fig. 4B bars 1 and 3). In contrast, yeast PDAT lias 
no preference for the transfer of vemoloyi groups over oleoyi groups (Fig. 4B 
bars 1 and 4). The acyl chain of the acceptor DAG molecule also affects the 
efficiency of the reaction. Thus, DAG with a ricinoleoyi or a vemoloyi group is a 
5 more efficient acyl acceptor than dioleoyl-DAG (Fig. 4B bars 1 , 5 and 6). Taken 
together, these results clearly show that the efficiency of the PDAT-catalyzed 
acyl transfer is strongly dependent on the properties of the substrate lipids. 

PDAT genes. Nucleotide and amino acid sequences of several PDAT 
10 genes are given as SEQ ID No. 1 through 15, Futher provisional and/or partial 
sequences are given as SEQ ID NO 1a through 5a and lb through 11b, 
respectively. One of the Arabidopsis genomic sequences (SEQ ID NO. 4) 
identfied an Arabidopsis EST cDNA clone; T04806. This cDNA clone was fully 
characterised and the nucleotide sequence is given as SEQ ID NO. 5. Based 
15 on the sequence homology of the T04806 cDNA and the Arabidopsis thafiana 
genomic DNA sequence (SEQ ID NO 4) it is apparent that an additional A is 
present at position 417 in the cDNA clone (data not shown). Excluding this 
nucleotide would give the amino acid sequence depicted in SEQ ID NO. 12. 

20 Increased TAG content in seeds of Arabidopsis ttialiana that express tfie 

veast PDAT For the expression of the yeast PDAT gene in Arabidopsis 
thaliana an EcoRI fragment from the pBluescript-PDAT was cloned together 
with napin promoter (25) into the vector pGPTV-KAN (26). A plasmid 
(pGNapPDAT) having the yeast PDAT gene in the correct orientation was 

25 identified and transformed into Agrobacterium tumefaciens. These bacteria 
were used to transform Arabidopsis thaliana Columbia (C-24) plants using the 
root transfomnation method (27). Plants transformed with an empty vector were 
used as controls. 

Rrst generation seeds (T1) were harvested and germinated on kanamycin 
30 containing medium. Second generation seeds (T2) were pooled from individual 
plants and their fatty acid contents analysed by quantification of tiieir methyl 
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esthers by gas liquid ciiromatograpliy after methylation of tine seeds with 2% 
sulphuric acid in methanol at 85 for 1,5 hours. Quantification was done with 
heptadecanoic acid methyl esters as internal standard. 

From the transfomnation with pGNapPDAT one T1 plant (26-14) gave raise to 
5 seven T2 plants of which 3 plants yielded seeds with statistically (in a mean 
difference two-sided test) higher oil content than seeds from T2 plants 
generated from T1 plant 32-4 transformed with an empty vector (table 2). 
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Description of Rgures 
FIG. 1. 

Metabolism of ^"^C-labeied PC into the neutral lipid fraction bv plant 
5 microsomes. (A) Microsomes from developing seeds of sunflower, R 
communis and C. palaestina were incubated for 80 min at SO^'C with PC (8 
nmol) having oleic acid in its sn-1 position, and either ^"^C-labeled oleic, 
ricinoleic or vemolic acid in its sn-2 position. Radioactivity incorporated in TAG 
(open bars), DAG (solid bars), and unsterified fatty acids (hatched bars) was 
10 quantified using thin layer chromatography followed by electronic 
autoradiography, and is shown as percentage of added labeled substrate. (B) 
Synthesis in vitro of TAG carrying two vemoloyi and one [^^C]ricinoleoyl group 
by microsomes from R. communis. The substrates added were unlabeled 
divemoloyl-DAG (5 nmol), together with either sn-1-oleoyl-s/>2-[^'*Clricinoleoyl- 
15 DAG (0.4 nmol, 7700 dpm/nmol) or sn-1-oleoyl-sn-2-[^'*C]ricinoleoyl-PC (0.4 
nmol, 7700 dpm/nmol). The microsomes were incubated with the substrates for 
30 min at SO^'C, after which samples were removed for lipid analysis as 
described in the section ^general methods". The data shown are the average of 
two experiments. 



PDAT activitv in veast microsomes, as visualized bv autoradioaram of neutral 
lipid products separated on TLC . Microsomal membranes (10 nmol of PC) from 
the wild type yeast strain P/1679 (lanes 1-3), a congenic yeast strain 

25 (FVKT004-04C(AL)) that is disrupted for YNROOSw (lane 4) or the same 
disnjption strain transformed with the plasmid pUS1, containing the YNR008w 
gene behind its native promotor (lane 5), were assayed for PDAT activity. As 
substrates, we used 2 nmol sn-1-oleoyl-sn-2-[^^C]ricinoleoyl-PC together with 
either 5 nmol of dioleoyl-DAG (lanes 2, 4 and 5) or rac-oleoyl-vernoleoyl-DAG 

30 (lane 3). The enzymatic assay and lipid analysis was performed as described in 
Materials and Methods. The cells were precultured for 20 h in liquid YPD 



FIG. 2. 
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medium, harvested and re-suspended in an equal volume of minimal medium 
(19) containing 16 g/l glycerol. The cells were then grown for an additional 24 h 
prior to being harvested. Selection for the plasmid was maintained by growing 
the transformed cells in synthetic medium lacking uracil (18). Abbreviations: 1- 
5 OH-TAG, monoricinoleoyl-TAG; 1 -OH-1 -ep-TAG, monoricinoleoyl- 
monovernoloyl-TAG; OH-FA, unesterified ricinoleic acid. 

Fig. 3. 

Lipid content (A.B) and PDAT activity (C) in PDAT overexpressinq veast cells. 

10 The PDAT gene in the plasmid pUS4 was overexpressed from the galactose- 
induced GAL1-TPK2 promoter in the wild type strain W303-1A (7), Its 
expression was induced after (A) 2 hours or (B) 25 hours of growth by the 
addition of 2% final concentration (w/v) of galactose. The cells were then 
incubated for another 22 hours before being harvested. The amount of lipids of 

15 the harvested cells was determined by GLC-analysis of its fatty acid contents 
and is presented as pmol fatty acids per mg dry weight in either TAG (open 
bar), polar lipids (hatched bar), sterol esters (solid bar) and other lipids (striped 
bar). The data shown are the mean values of results with three independent 
yeast cultures. (C) In vitro synthesis of TAG by microsomes prepared from 

20 yeast cells containing either the empty vector (vector) or the PDAT plasmid (+ 
PDAT). The cells were grown as in Fig. 3A. The substrate lipids dioleoyl-DAG 
(2.5 nmol) and sn-1oleoyl-sn-2-[^'^G]-oleoyl-PC (2 nmol) were added to aliquots 
of microsomes (10 nmol PC), which were then incubated for 10 min at 28 °C. 
The amount of label incorporated into TAG was quantified by electronic 

25 autoradiography. The results shown are the mean values of two experiments. 

FIG. 4. 

Substrate SDecificitv of veast PDAT. The PDAT activity was assayed by 
incubating aliquots of lyophilized microsomes (10 nmol PC) with substrate lipids 
30 at 30°C for 10 min (panel A) or 90 min (panel B). Unlabeled DAG (2.5 nmol) 
was used as substrates together with different labeled phospholipids, as shown 
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in the figure. (A) Sn-position specificity of yeast PDAT regarding the acyl donor 
substrate. Dioleoyl-DAG together with either s/>1-[^'*C]oleoyl-sr>2-[^'^C]oleoyl- 
PC (di-L^^'Cl-PC), sn-1-[^^C]oleoyl-sn-2-oieoyl-PC (sn^-['^CyPC) or s/7-1-oieoyl- 
srv2-[^'^C]oleoyl-PC (sn2-[^'^C]-PC). (B) Specificity of yeast PDAT regarding 

5 phospholipid headgroup and of the acyl connposition of the phospholipid as well 
as of the diacylglycerol. Dioleoyl-DAG together \Anth either sn-1 -oleoyl-s/>2- 
[^"^Cloleoyl-PC (oleoyl-PC), sn-1-oleoyl-sn-2-[^''C]oleoyl-PE (oleoyl-PE). sn-1- 
oleoyl-sn-2-[^'^C]ricinoleoyl-PC (ricinoleoyl-PC) or sn-1 -oleoyl-sn-2- 
[^"^Clvemoloyl-PC (vemoloyl-PC). In the experinnents presented in the 2 bars to 

10 the far right, monoricinoleoyl-DAG (ricinoleoyl-DAG or mono-vemoloyl-DAG 
(vemoloyl-DAG) were used together with sn-1 -oleoyl-s^^2-[^'^C]-oleoyl-PC, The 
label that was incorporated Into TAG (solid bars) and lyso-PC (LPC, open bars) 
was quantified by electronic autoradiography. The results shown are the mean 
values of two experiments. The microsomes used were from W303-1A cells 

15 overexpressing the PDAT gene from the GAL1'TPK2 promoter, as described in 
Fig. 3. The expression was induced at early stationary phase and the cells were 
harvested after an additional 24 h. 

20 TAB.1: 

In vitro synthesis of trlacvlalvcerols in microsomal preparations of developlna 
castor bean. Aliqouts of microsomes (20 nmol PC) were lyophlllsed and 
substrate lipids were added in benzene solution: (A) 0.4 nmol [^^C]-DAG (7760 
dpm/nmol) and where indicated 1.6 nmol unlabelled DAG; (B) 0.4 nmol [^"^C]- 

25 DAG (7760 dpm/nmol) and 5 nmol unlabelled di-ricinoleoyl-PC and (C) 0.25 
nmol [^"^Cl-PC (4000 dpm/nmol) and 5 nmol unlabelled DAG. The benzene was 
evaporated by N2 and 0.1 ml of 50 mM potassium phosphate was added, 
thoroughly mixed and incubated at 30 for (A) 20 min.; (B) and (C) 30 min.. 
Assays were terminated by extraction of the lipids in chloroform. The lipids 

30 were then separated by thin layer chromatography on silica gel 60 plates 
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(Merck; Darmstadt, Germany) in hexan/diethylether/acetic 35:70:1.5. The 
radioactive lipids were visualised and the radioactivity quantified on the plate by 
electronic autoradiography (Instant Imager, Packard, US). Results are 
presented as mean values of two experiments. 

5 

Radioactivity in different triacylglycerols (TAG) species formed. Abbreviations 
used: 1-OH-, mono-ricinoleoyi-; 2-OH, di-ricinoieoyi-; 3-OH-, triricinoleoyi; 1- 
OH-1 -ver-, mono-ricinoleoly-monovernoleoyi-; 1 -OH-2-ver-, mono-ricinoleoyl- 
divemoleoyl-. Radiolabelled DAG and PC were prepared enzymaticaliy. The 
10 radiolabelled ricinoleoyi group is attached at the sn-2-position of the lipid and 
unlabelled oleoyi group at the sn-1 -position. Unlabelled DAG with vemoleoyi- or 
ricinoleoyi chains were prepared by the action of TAG lipase (6) on oil of 
Euphorbia lagascae or Castor bean, respectively. Synthetic di-ricinoleoyl-PC 
was kindly provided from Metapontum Agribios (Italy). 

15 

TAB.2: 

Total fatty acids per mg of T2 seeds pooled from individual Arabidopsis thaliana 
plants transformed with yeast PDAT gene under the control of napin promoter 
20 (26-14) or transformed with empty vector (32-4). 

* = stastistical difference between control plants and PDAT transformed plants 
in a mean difference two-sided test at a = 5. 
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Description of the SEQ ID: 

SEQ ID NO. 1: Genomic DNA sequence and suggested amino acid sequence of 
the Saccharomyces cerevisiae PDAT gene, YNROOSw, with GenBanl< accession 
5 number 271 623 and Y1 31 39, and with nucleotide ID number 1 302481 . 

SEQ ID NO. 2. The amino acid sequence of the suggested open reading frame 
YNROOSw from Saccharomyces cerevisiae. 

10 SEQ ID NO. 3: Genomic DNA sequence of the Schizosacctiaromyces pombe gene 
SPBC776.14. 

SEQ ID NO. 4: Genomic DNA sequence of part of the Arabidopsis thaliana locus 
with GenBank accession number AB006704. 

15 

SEQ ID NO. 5: Nucleotide sequence of the Arabidopsis tlialiana cDNA clone with 
GenBank accession number T04806, and nucleotide ID number 315966. 

SEQ ID NO. 6: Predicted amino acid sequence of the Arabidopsis thaliana cDNA 
20 clone with GenBank accession number T04806. 

SEQ ID NO. 7: Nucleotide and amino acid sequence of the Zea mays EST clone 
with GenBank accession number AI491339, and nucleotide ID number 4388167. 

25 SEQ ID NO. 8: Predicted amino acid sequence of the Zea mays EST clone with 
GenBank accession number AI491339, and nucleotide ID number 4388167. 

SEQ ID NO. 9. DNA sequence of part of the Neurospora crassa EST clone 
W07G1, with GenBank accession number AI398644, and nucleotide ID number 
30 4241729. 
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SEQ ID NO. 10. Genomic DNA sequence of part of the Arabidopsis thaliana locus 
with GenBank accession numiser ACX)04557. 

SEQ ID NO. 1 1: Genomic DNA sequence of part of the Arabidopsis thaliana locus 
5 with GenBank accession number AC003027. 

SEQ ID NO. 12: DNA sequencce of part of the Lycopersicon esculentum cDNA 
clone with GenBank accession number Ai486635. 

10 SEQ ID NO. 13: Amino acid sequence of the Schizosaccharomyces pombe 
putative open reading frame CAA22887 of the Schi2X3saccharomyces pombe gene 
SPBC776.14. 

SEQ ID NO. 14: Amino acid sequence of the Arabidopsis tiiaiiana putative open 
IS reading frame AAC80628 derived from the Arabidopsis thaliana l(x;us wth 
GenBank accession number AC004557. 

SEQ ID NO 15: Amino acid sequence of the Arabidopsis thaliana putative open 
reading frame AAD10668 derived from the Arabidopsis thaliana locus with 
20 GenBank accession number AC003027. 

Further provisional and/or partial sequences are defined through the followina 



25 SEQ ID NO. la: The amino acid sequence of the yeast ORF YNR008w from 
Saccharomyces cerevisiae. 

SEQ ID NO. 2a: Amino acid sequence of the region of the Arabidopsis thaliana 
genomic sequence (AC004557). 

30 



SEQ IDs: 
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SEQ ID NO. 3a: Amino acid sequence of the region of the Arabidopsis thaliana 
genomic sequence (AB006704). 

SEQ ID NO. 4a: The corresponding genomic DNA sequence and amino acid 
5 sequence of the yeast ORF YNROOSw from Saccharomyces cerevisiae. 

SEQ ID NO. 5a: The amino acid sequence of the yeast ORF YNROOSw from 
Saccharomyces cerevisiae derived form the corresponding gerwmic DNA 
sequence. 



SEQ ID NO. lb: Genomic DNA sequence of the Saccharomyces cerevisiae 
PDAT gene, YNROOSw, genebank nucleotide ID number 1302481, and the 
suggested YNROOSw amino acid sequence. 



SEQ ID NO. 2b: The suggested amino acid sequence of the yeast gene 
YNROOSw from Saccharomyces cerevisiae. 

SEQ ID NO. 3b: Genomic DNA sequence of the Schizosaccharomyces pombe 
20 gene SPBC776. 1 4. 

SEQ ID NO. 4b: Genomic DNA sequence of part of the Arabidopsis thsdiana 
locus with genebank accession number AB006704. 

25 SEQ ID NO. 5b: Nucleotide sequence and the corresponding amino acid 
sequence of the Arabidopsis thaliana EST-clone with genebank accession 
number T04806, and ID number 315966. 

SEQ ID NO. 6b: Nucleotide and amino acid sequence of the Zea mays cONA 
30 clone with genebank ID number 43881 67. 



10 



15 
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SEQ ID NO. 7b: Amino acid sequence of tlie Zea mays cDNA clone with 
genebanic ID number 4388167. 

SEQ ID NO. 8b: DNA sequence of part of the Neurospora crassa cDNA clone 
5 W07G 1 , ID number 4241 729. 

SEQ ID NO. 9b: Genomic DNA sequence of part of the Arabidopsis thaliana 
locus with genebank accession number AC004557. 

10 SEQ ID NO. 10b: Genomic DNA sequence of part of the Arabidopsis thaliana 
locus with genebank accession number AC003027. 

SEQ ID NO. 1 lb: DNA sequence of part of the Lycopersicon esculentum cDNA 
done with genebank accession number AI486635. 

15 



20 



25 



30 



SUBSTITUTE SHEET (RULE 26) 



>__00©009SA2_L> 





wo 00/60095 



PCT/EPOO/02701 



Claims 



1 . An enzyme catalysing in an acyl-CoA-independent reaction the transfer of 
fatty acids from phospholipids to diacylgiyceroi in the biosynthetic pathway 



2. An enzyme according to claim 1 , comprising an amino acid sequence as 
set forth in SEQ ID No. 2 or a functional fragment, derivate, allele, homolog 
or isoenzyme thereof. 

10 

3. An enzyme according to claims 1 or 2 designated as 
phospholipid:diacylglycerol acyltransferase (PDAT). 

4. An enzyme according to claims 1 to 3, comprising an amino acid sequence 
15 as set forth in SEQ ID No. 1a, 2b or 5a or a functional fragment, derivate, 

allele, homolog or isoenzyme thereof. 

5. An enzyme according to claims 1 to 4, comprising an amino acid sequence 
selected from the group consisting of sequences as set forth in SEQ ID No. 

20 2a, 3a, 5b, 6, 7b, 8, 13, 14 or 15 or a functional fragment, derivate, allele, 
homolog or isoenzyme thereof. 

6. An enzyme according to claims 1 to 5, comprising an amino acid sequence 
encoded through a nucleotide sequence, a portion, derivate, allele or 

25 homolog thereof selected from the group consisting of sequences as set 
forth in SEQ ID No. 1. lb, 3, 3b. 4, 4a, 4b. 5, 5b. 6b, 7, 8b, 9. 9b, 10. 10b, 
11. lib or 12 or a functional fragment, derivate, allele, homolog or 
isoenzyme of the enzyme encoding amino acid sequence. 

30 7. A nucleotide sequence encoding an enzyme catalysing in an acyl-CoA- 
independent reaction the transfer of fatty acids from phospholipids to 



5 



for the production of triacylglycerol. 
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diacylglycerol in the biosynthetic pathway for the production of 
triacylglycerol. 

8. A nucleotide sequence according to claim 7 encoding an enzyme 
5 designated as phospholipid:diacyiglycerol acyltransferase (PDAT). 

9. A nucleotide sequence according to claims 7 or 8, selected from the group 
consisting of sequences as set forth in SEQ ID No. 1, lb, 3, 3b, 4, 4a, 4b, 
9b, 10, 10b or 1 1 or a portion, derivate, allele or homolog thereof. 

10 

10. A partial nucleotide sequence corresponding to a fullength nucleotide 
sequence according to claims 7 to 9, selected from the group consisting of 
sequences as set forth in SEQ ID No. 5, 5b, 6b, 7, 8b, 9, 11b or 12 or a 
portion, derivate, allele or homolog thereof. 

15 

1 1. A nucleotide sequence according to claims 7 to 10, comprising a nucleotide 
sequence which is at least 40% homologous to a nucleotide sequence 
selected form the group consisting of those sequences set forth in SEQ ID 
No. 1, lb, 3, 3b, 4, 4a, 4b. 5, 5b, 6b, 7, 8b, 9, 9b, 10, 10b, 1 1. lib or 12. 

20 

12. A gene construct comprising a nucleotide sequence according to claims 7 
to 1 1 operably linked to a heterologous nucleic acid. 

13. A vector comprising a nucleotide sequence according to claims 7 to 1 1 or a 
25 gene construct according to claim 12. 

14. A vector according to claim 13, which is an expression vector. 

15. A vector according to claims 13 or 14, further comprising a selectable 
30 marker gene and/or nucleotide sequences for the replication in a host cell 

or the integration into the genome of the host celL 
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16. A transgenic cell or organism containing a nucleotide sequence according 
to claims 7 to 11 and/or a gene construct according to claim 12 and/or a 
vector according to claims 13 to 15. 



17. A transgenic cell or organism according to claim 16 which is an eucaryotic 
cell or organism. 

18. A transgenic cell or organism according to claims 16 or 17 which is a yeast 



19. A transgenic cell or organism according to claims 16 to 18 having an 
altered biosynthetic pathway for the production of triacyiglyceroi. 

15 20. A transgenic cell or organism according to claims 16 to 19 having an 
altered oil content. 

21. A transgenic cell or organism according to claims 16 to 20 wherein the 
activity of PDAT is altered. 



22. A transgenic cell or organism according to claims 16 to 21 wherein the 
altered activity of PDAT is characterized by an alteration in gene 
expression, catalytic activity and/or regulation of activity of the enzyme. 

25 23. A transgenic cell or organism according to claims 16 to 22 wherein the 
altered biosynthetic pathway for the production of triacyiglyceroi is 
characterized by the prevention of accumulation of undesirable fatty acids 
in the membrane lipids. 

30 24. A process for the production of triacyiglyceroi, comprising growing a 
transgenic cell or organism according to claims 16 to 23 under conditions 



5 



10 



cell or a plant cell or a plant. 



20 
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whereby the said nucleotide sequence according to claims 7 to 11 is 
expressed. 

25. Triacylglycerols produced by a process according to claim 24, 

5 

26. Use of a nucleotide sequence according to claims 7 to 11 and/or an 
enzyme according to claims 1 to 6 for the production of triacylglycerol 
and/or triacylglycerols vn\h uncommon fatty acids. 

10 27. Use of a nucleotide sequence according to claims 7 to 11 and/or an 
enzyme according to claims 1 to 6 for the transformation of any cell or 
organism in order to be expressed in tills ceil or organism and result in an 
altered, preferably Increased oil content of this cell or organism. 

15 
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Tab. 2: 

Tl plant T2 plant number nmol fatty acids per mg seed standard 
deviation 



1 


1277 


+11 (n=2) 


4 


1261 


+63 (n=3) 


5 


1369 


+17 (n=3) 


6 


1312 


+53 (n=4) 


7 


1197 


±54 (n=5) 


8 


1240 


+78 (n=4) 


9 


1283 


±54 (n= 5) 


10 


1381 


±35 (n=5) 


1 


1444 


+110 (n=4) 


2 


1617* 


+109 (n=4) 


3 


1374 


+37 (n=2) 


5 


1562* 


+70 (n=4) 


6 


1393 


+77 (n=4) 


7 


1433 


+98 (n=4) 


8 


1581* 


±82 (n=4) 



>_00e0095A2_l_> 



SUBSTITUTE SHEET (RULE 26) 




wo 00/60095 I 1 53 "^^t/EPOO/OZTOl 




48 



Sequence Listing 

<210> 1 
<211> 1986 
<212> genomic DNA 
<213> Saccharomyces cerevisiae 

<221> CDS 
<222> (1) . . (1983) 

<400> 1 

atg ggc aca ctg ttt cga aga aat gcc cag aac caa aag agt gat tec 
Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
15 10 15 

gat gaa aac aat aaa ggg ggt tct gtt cat aac aag cga gag age aga 96 
Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

aac cac act cat cat caa cag gga tta ggc cat aag aga aga agg ggt 144 
Asn His He His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 

att agt ggc agt gca aaa aga aat gag cgt ggc aaa gat ttc gac agg 192 
He Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 
50 55 , 60 

aaa aga gac ggg aac ggt aga aaa cgt tgg aga gat tec aga aga ctg 240 
Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 80 

att ttc att ctt ggt gca ttc tta ggt gta ctt ttg ccg ttt age ttt 288 
He Phe He Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 

ggc get tat cat gtt cat aat age gat age gac ttg ttt gac aac ttt 33 6 
Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105 110 

gta aat ttt gat tea ctt aaa gtg tat ttg gat gat tgg aaa gat gtt 384 
Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

etc cca caa ggt aca age teg ttt att gat gat att cag get ggt aac 432 
Leu Pro Gin Gly He Ser Ser Phe He Asp Asp He Gin Ala Gly Asn 
130 135 140 
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cac tec aca tct tct tta gat gat etc agt gaa aa't ttt gcc gtt ggt 480 
Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

aaa caa etc tta cgt gat tat aat ate gag gee aaa cat cet gtt gta 528 
Lys Gin Leu Leu Arg Asp Tyr Asn lie Glu Ala Lys Kis Pro Val Val 
165 170 175 

atg gtt cet ggt gtc att tct acg gga att gaa age tgg gga get att 576 
Met Val Pro Gly Val lie Ser Thr Gly lie Glu Ser Trp Gly Val He 
180 185 190 

gga gac gat gag tgc gat agt tct gcg cat ttt cgt aaa egg ctg tgg 624 
Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 

gga agt ttt tac atg ctg aga aca atg gtt atg gat aaa gtt tgt tgg 672 
Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

teg aaa cat gta atg tta gat cet gaa aca ggt ctg gac cca ccg aac 720 
Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

ttt acg eta cgt gea gca cag gge ttc gaa tea act gat tat ttc ate 768 
Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 
245 250 255 

gca ggg tat tgg att tgg aac aaa gtt ttc caa aat ctg gga gta att 816 
Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn Leu Gly Val lie 
260 265 270 

gge tat gaa ccc aat aaa atg acg agt get gcg tat gat tgg agg ett 864 
Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 

gca tat tta gat eta gaa aga egc gat agg tac ttt acg aag eta aag 912 
Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 295 300 

gaa caa ate gaa ctg ttt cat caa ttg agt ggt gaa aaa gtt tgt tta 960 
Glu Gin He Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

att gga cat tct atg ggt tct cag att ate ttt tac ttt atg aaa tgg 1008 
He Gly His Ser Met Gly Ser Gin He He Phe Tyr Phe Met Lys Trp 
325 330 335 
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gtc gag get. gaa ggc cct: ctt tac ggt aac ggt ggt cgt ggc tgg get 1056 

Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Giy Arg Gly Trp Val 
340 345 350 



aac gaa cac ata gat tea ttc att aat gca gca ggg acg ett ctg ggc 1104 
Asn Glu His lie Asp Ser Phe lie Asn Ala Ala Gly Thr Leu Leu Gly 
355 360 365 



get cea aag gca gtt cca get eta att agt ggt gaa atg aaa gat ace 1152 
Ala Pro Lys Ala Val Pro Ala Leu lie Ser Gly Glu Met Lys Asp Thr 
370 375 380 

att caa tta aat aeg tta gcc atg tat ggt ttg gaa aag tte ttc tea 1200 
lie Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 



aga att gag aga gta aaa atg tta caa aeg tgg ggt ggt ata cca tea 1248 
Arg lie Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly lie Pro Ser 
405 410 415 



atg eta cea aag gga gaa gag gtc att tgg ggg gat atg aag tea tct 1296 
Met Leu Pro Lys Gly Glu Glu Val lie Trp Gly Asp Met Lys Ser Ser 
420 425 430 



tea gag gat gca ttg aat aac aac act gac aca tac ggc aat ttc att 1344 
Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe lie 
435 440 445 



cga ttr gaa agg aat acg age gat get ttc aac aaa aat ttg aca atg 1392 
Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
450 455 460 

aaa gac gcc att aac atg aca tta teg ata tea cct gaa tgg etc caa 1440 
Lys Asp Ala lie Asn Met Thr Leu Ser lie Ser Pro Glu Trp Leu Gin 
465 470 475 480 



aga aga gta cat gag cag tac teg tte ggc tat tee aag aat gaa gaa 1488 
Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 495 



gag tta aga aaa aat gag eta cac cac aag cac tgg teg aat cea atg 153 6 
Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
500 505 510 



gaa gta cca ctt cca gaa get ecc cac atg aaa ate tat tgt ata tac 1584 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys lie Tyr Cys He Tyr 

515 520 525 
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ggg gtg aac aac cca act gaa agg gca tat gta cat aag gaa gag gat 1632 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 

530 535 540 

gac tec tct get ctg aat ttg ace ate gac tac gaa age aag caa cet 1680 
Asp Ser Ser Ala Leu Asn Leu Thr lie Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 

gta ttc etc acc gag ggg gac gga ace gtt ccg etc gtg geg eat tea 1728 
Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 570 575 

atg tgt cac aaa tgg gee eag ggt get tea ccg tac aac cct gcc gga 1776 
Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 585 590 

act aac gtt act att gtg gaa atg aaa cac eag cca gat cga ttt gat 1824 
lie Asn Val Thr lie Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 
595 600 605 

ata cgt ggt gga gca aaa age gee gaa cac gta gac ate etc ggc age 1872 
lie Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp lie Leu Gly Ser 
610 615 620 

geg gag ttg aac gat tac ate ttg aaa att gca age ggt aat ggc gat 1920 
Ala Glu Leu Asn Asp Tyr lie Leu Lys lie Ala Ser Gly Asn Gly Asp 
625 630 635 640 

etc gtc gag eca ege caa ttg tct aat ttg age eag tgg gtt tet eag 1968 
Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 655 

atg ecc ttc eca atg taa 1986 
Met Pro Phe Pro Met 
660 
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<210> 2 
<211> 661 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 2 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 

15 10 15 

Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Giu Ser Arg 

20 25 30 

Asn His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 

35 40 45 

lie Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 

50 55 60 

Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 80 

lie Phe lie Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 

85 90 95 

Gly Ala Tyr Kis Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 

100 105 110 

Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 

115 120 125 

Leu Pro Gin Gly lie Ser Ser Phe He Asp Asp lie Gin Ala Gly Asn 

130 135 140 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

Lys Gin Leu Leu Arg Asp Tyr Asn He Glu Ala Lys His Pro Val Val 

165 170 175 

Met Val Pro Gly Val He Ser Thr Gly He Glu Ser Trp Gly Val He 

180 185 190 

Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 

195 200 205 

Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 

210 215 220 

Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 

245 250 255 

Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 

260 265 270 

Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 

275 280 285 

Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 

290 295 300 

Glu Gin He Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

He Gly Kis Ser Met Gly Ser Gin He He Phe Tyr Phe Met Lys Trp 
325 330 335 
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Val Glu Ala Giu Gly Pro Leu Tyr Giy Asn Gly Gly Arg Gly Trp Val 

340 345 350 

Asn Glu His lie Asp Ser Phe lie Asn Ala Ala Gly Thr Leu Leu Gly 

355 360 365 

Ala Pro Lys Ala Val Pro Ala Leu lie Ser Gly Glu Met Lys Asp Thr 

370 375 380 

lie Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 

Arg lie Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly lie Pro Ser 

405 410 415 

Met Leu Pro Lys Gly Glu Glu Val lie Trp Gly Asp Met Lys Ser Ser 

420 425 430 

Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe lie 

435 440 445 

Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 

450 455 460 

Lys Asp Ala lie Asn Met Thr Leu Ser lie Ser Pro Glu Trp Leu Gin 
465 470 475 480 

Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 

485 490 495 

Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 

500 505 510 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys lie Tyr Cys lie Tyr 

515 520 525 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 

530 535 540 

Asp Ser Ser Ala Leu Asn Leu Thr lie Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 

Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 

565 570 575 

Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 

580 585 590 

lie Asn Val Thr lie Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 

595 600 605 

lie Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp lie Leu Gly Ser 

610 615 620 

Ala Glu Leu Asn Asp Tyr lie Leu Lys lie Ala Ser Gly Asn Gly Asp 
625 630 635 640 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 

645 650 655 

Met Pro Phe Pro Met 



660 
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<210> 3 

<211> 2312 

<212> genomic DNA 

<213> Schizosaccharomyces pombe 



<400> 3 

ATGGCGTCTT CCAAGAAGAG CAAAACTCAT AAGAAAAAGA AAGAAGTCAA 50 

ATCTCCTATC GACTTACCAA ATTCAAAGAA ACCAACTCGC GCTTTGAGTG 100 

AGCAACCTTC AGCGTCCGAA ACACAATCTG TTTCAAATAA ATCAAGAAAA 150 

TCTAAATTTG GAAAAAGATT GAATTTTATA TTGGGCGCTA TTTTGGGAAT 200 

ATGCGGTGCT TTTTTTTTCG CTGTTGGAGA CGACAATGCT GTTTTCGACC 250 

CTGCTACGTT AGATAAATTT GGGAATATGC TAGGCTCTTC AGACTTGTTT 300 

GATGACATTA AAGGATATTT ATCTTATAAT GTGTTTAAGG ATGCACCTTT 350 

TACTACGGAC AAGCCTTCGC AGTCTCCTAG CGGAAATGAA GTTCAAGTTG 400 

GTCTTGATAT GTACAATGAG GGATATCGAA GTGACCATCC TGTTATTATG 4 50 

GTTCCTGGTG TTATCAGCTC AGGATTAGAA AGTTGGTCGT TTAATAATTG 500 

CTCGATTCCT TACTTTAGGA AACGTCTTTG GGGTAGCTGG TCTATGCTGA 550 

AGGCAATGTT CCTTGACAAG CAATGCTGGC TTGAACATTT AATGCTTGAT 600 

AAAAAAACCG GCTTGGATCC GAAGGGAATT AAGCTGCGAG CAGCTCAGGG 650 

GTTTGAAGCA GCTGATTTTT TTATCACGGG CTATTGGATT TGGAGTAAAG 7 00 

TAATTGAAAA CCTTGCTGCA ATTGGTTATG AGCCTAATAA CATGTTAAGT 750 

GCTTCTTACG ATTGGCGGTT ATCATATGCA AATTTAGAGG AACGTGATAA 800 

ATATTTTTCA AAGTTAAAAA TGTTCATTGA GTACAGCAAC ATTGTACATA 850 

AGAAAAAGGT AGTGTTGATT TCTCACTCCA TGGGTTCACA GGTTACGTAC 900 

TATTTTTTTA AGTGGGTTGA AGCTGAGGGC TACGGAAATG GTGGACCGAC 950 

TTGGGTTAAT GATCATATTG AAGCATTTAT AAATGTGAGT CTCGATGGTT 1000 
GTTTGACTAC GTTTCTAACT TTTGAATAGA TATCGGGATC TTTGATTGGA 1050 

GCACCCAAAA CAGTGGCAGC GCTTTTATCG GGTGAAATGA AAGATACAGG 1100 

TATTGTAATT ACATTAAACA TGTTAATATT TAATTTTTGC TAACCGTTTT 1150 

AAGCTCAATT GAATCAGTTT TCGGTCTATG GGTAAGCAAT AAATTGTTGA 1200 

GATTTGTTAC TAATTTACTG TTTAGTTTGG AAAAATTTTT TTCCCGTTCT 1250 

GAGGTATATT CAAAAATACA AATGTGCTCT ACTTTTTCTA ACTTTTAATA 1300 

GAGAGCCATG ATGGTTCGCA CTATGGGAGG AGTTAGTTCT ATGCTTCCTA 1350 

AAGGAGGCGA TGTTGTATGG GGAAATGCCA GTTGGGTAAG AAATATGTGC 1400 

TGTTAATTTT TTATTAATAT TTAGGCTCCA GATGATCTTA ATCAAACAAA 1450 

TTTTTCCAAT GGTGCAATTA TTCGATATAG AGAAGACATT GATAAGGACC 1500 

ACGATGAATT TGACATAGAT GATGCATTAC AATTTTTAAA AAATGTTACA 1550 

GATGACGATT TTAAAGTCAT GCTAGCGAAA AATTATTCCC ACGGTCTTGC 1600 
TTGGACTGAA AAAGAAGTGT TAAAAAATAA CGAAATGCCG TCTAAATGGA 1650 

TAAATCCGCT AGAAGTAAGA ACATTAAAGT TACTAAATTA TACTAACCCA 1700 

AATAGACTAG TCTTCCTTAT GCTCCTGATA TGAAAATTTA TTGCGTTCAC 1750 

GGGGTCGGAA AACCAACTGA GAGAGGTTAT TATTATACTA ATAATCCTGA 1800 

GGGGCAACCT GTCATTGATT CCTCGGTTAA TGATGGAACA AAAGTTGAAA 1850 

ATGTGAGAGA ATTTATGTTT CAAACATTCT ATTAACTGTT TTATTAGGGT 1900 

ATTGTTATGG ATGATGGTGA TGGAACTTTA CCAATATTAG CCCTTGGTTT 1950 

GGTGTGCAAT AAAGTTTGGC AAACAAAAAG GTTTAATCCT GCTAATACAA 2000 

GTATCACAAA TTATGAAATC AAGCATGAAC CTGCTGCGTT TGATCTGAGA 2050 

GGAGGACCTC GCTCGGCAGA ACACGTCGAT ATACTTGGAC ATTCAGAGCT 2100 
AAATGTATGT TCATTTTACC TTACAAATTT CTATTACTAA CTCTTGAAAT 2150 
AAGGAAATTA TTTTAAAAGT TTCATCAGGC CATGGTGACT CGGTACCAAA 2200 
CCGTTATATA TCAGATATCC AGTACGGACA TAAGTTTTGT AGATTGCAAT 2250 
TAACTAACTA ACCGAACAGG GAAATAATAA ATGAGATAAA TCTCGATAAA 2300 
CCTAGAAATT AA 2312 
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<210>™4 • 

<211> 3685 

<212> genomic DNA 

<213> Arabidopsis thaliana 

<400> 4 

ATGCCCCTTA TTCATCGGAA AAAGCCGACG GAGAAACCAT CGACGCCGCC 
ATCTGAAGAG GTGGTGCACG ATGAGQATTC GCAAAAGAAA CCACACGAAT 
CTTCCAAATC CCACCATAAG AAATCGAACG GAGGAGGGAA GTGGTCGTGC 
ATCGATTCTT GTTGTTGGTT CATTGGGTGT GTGTGTGTAA CCTGGTGGTT 
TCTTCTCTTC CTTTACAACG CAATGCCTGC GAGCTTCCCT CAGTATGTAA 
CGGAGCGAAT CACGGGTCCT TTGCCTGACC CGCCCGGTGT TAAGCTCAAA 
AAAGAAGGTC TTAAGGCGAA ACATCCTGTT GTCTTCATTC CTGGGATTGT 
CACCGGTGGG CTCGAGCTTT GGGAAGGCAA ACAATGCGCT GATGGTTTAT 
TTAGAAAACG TTTGTGGGGT GGAACTTTTG GTGAAGTCTA CAAAAGGTGA 
GCTCAACAAT TCTCACTCTT CCTTTATATT GGGATTTGGA TTGGATCTGA 
TGAGATCACG CACTTGTTGC TTCTTCAACA TCACTCAAAC TTTAATTCCA 
TGTTTGTCTG TCTTACTCTT TACTTTTTTT TTTTTTTGAT GTGAAACGCT 
ATTTTCTTAA GAGACTATTT CTGTATGTGT AAGGTAAGCG TTCCAAGGAC 
GTAATTGGCT TGGACTATTT CTGTTTGATT GTTAACTTTA GGATATAAAA 
TAGCTGCCTT GGAATTTCAA GTCATCTTAT TGCCAAATCT GTTGCTAGAC 
ATGCCCTAGA GTCCGTTCAT AACAAGTTAC TTCCTTTACT GTCGTTGCGT 
GTAGATTTAG CTTTGTGTAG CGTATAATGA AGTAGTGTTT TATGTTTTGT 
TGGGAATAGA GAAGTTCTAA CTACATCTGT GGAAAGTGTG TTCAGGCTGT 
GATAGAGGAC TGTTGCTTTA TTATTCAACT ATGTATATGT GTAATTAAAG 
CTAGTTCCTT TTTGATCTTT CAGCTCAATG TGCTTTTCTC AATTTTTTTC 
TCAATTTCAA AGTTTCACAT CGAGTTTATT CACATGTCTT GAATTTCGTC 
CATCCTCGTT CTGTTATCCA GCTTTGAACT CCTCCCGACC CTGCTATGGA 
TATATTAAAA AAAAAGTGTT TTGTGGGTTG CATCTTTGTT ACGATCTGCA 
TCTTCTTCTT TCGGCTCAGT GTTCATGTTT TTGCTATGGT AGAGATGGGC 
AATGTTATTG TTGATGGTAA CAGTGGTATA GTTGATAGTA TCTTAACTAA 
TCAATTATCT CTTTGATTCA GGCCTCTATG TTGGGTGGAA CACATGTCAC 
TTGACAATGA AACTGGGTTG GATCCAGCTG GTATTAGAGT TCGAGCTGTA 
TCAGGACTCG TGGCTGCTGA CTACTTTGCT CCTGGCTACT TTGTCTGGGC 
AGTGCTGATT GCTAACCTTG CACATATTGG ATATGAAGAG AAAAATATGT 
ACATGGCTGC ATATGACTGG CGGCTTTCGT TTCAGAACAC AGAGGTTCTT 
TTCTCATCGT TCTTTCTATT ATTCTGTTCC ATGTTACGTT TCTTTCTTCA 
TTACTTAAGG CTTAAATATG TTTCATGTTG AATTAATAGG TACGTGATCA 
GACTCTTAGC CGTATGAAAA GTAATATAGA GTTGATGGTT TCTACCAACG 
GTGGAAAAAA AGCAGTTATA GTTCCGCATT CCATGGGGGT CTTGTATTTT 
CTACATTTTA TGAAGTGGGT TGAGGCACCA GCTCCTCTGG GTGGCGGGGG 
TGGGCCAGAT TGGTGTGCAA AGTATATTAA GGCGGTGATG AACATTGGTG 
GACCATTTCT TGGTGTTCCA AAAGCTGTTG CAGGGCTTTT CTCTGCTGAA 
GCAAAGGATG TTGCAGTTGC CAGGTATTGA ATATCTGCTT ATACTTTTGA 
TGATCAGAAC CTTGGCTCTG GAACTCAAAG TTATTCTACT AAATATCAAT 
TCTAATAACA TTGCTATATT ATCGCTGCAA CTGACATTGG TTGATTATTT 
TTGCTGCTTA TGTAACTGAA ACTCTCTTGA GATTAGACAA ATGATGAATT 
GATAATTCTT ACGCATTGCT CTGTGATGAC CAGTTTCTTA GCTTCGACGA 
TAACATTTGT CATACTGTCT TTTGGAGGGC ATTGAATTTT GCTATGGAAA 
GCGCTGGAGC TTCCATGCTT GCATTCTTTA CCAATTAGCG TTATTCTGCT 
TCTTTCAATT TTCTTGTATA TGCATCTATG GTCTTTTATT TCTTCTTAAT 
TAAAGACTCG TTGGATTAGT TGCTCTATTA GTCACTTGGT TCCTTAATAT 
AGAACTTTAC TTTCTTCGAA AATTGCAGAG CGATTGCCCC AGGATTCTTA 
GACACCGATA TATTTAGACT TCAGACCTTG CAGCATGTAA TGAGAATGAC 
ACGCACATGG GACTCAACAA TGTCTATGTT ACCGAAGGGA GGTGACACGA 
TATGGGGCGG GCTTGATTGG TCACCGGAGA AAGGCCACAC CTGTTGTGGG 
AAAAAGCAAA AGAACAACGA AACTTGTGGT GAAGCAGGTG AAAACGGAGT 
TTCCAAGAAA AGTCCTGTTA ACTATGGAAG GATGATATCT TTTGGGAAAG 
AAGTAGCAGA GGCTGCGCCA TCTGAGATTA ATAATATTGA TTTTCGAGTA 
AGGACATATA AATCATAATA AACCTTGTAC ATTTTGTGAT TGTATGATGA 
ATATCTGTAC ATTTTATCTG GTGAAGGGTG CTGTCAAAGG TCAGAGTATC 
CCAAATCACA CCTGTCGTGA CGTGTGGACA GAGTACCATG ACATGGGAAT 
TGCTGGGATC AAAGCTATCG CTGAGTATAA GGTCTACACT GCTGGTGAAG 
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CTATAGATCT ACTACATTAT GTTGCTCCTA AGATGATGGC GCGTGGTGCC Z^OO"' 

GCTCATTTCT CTTATGGAAT TGCTGATGAT TTGGATGACA CCAAGTATCA 2950 

AGATCCCAAA TACTGGTCAA ATCCGTTAGA GACAAAGTAA GTGATTTCTT 3000 

GATTCCAACT GTATCCTTCG TCCTGATGCA TTATCAGTCT TTTTGTTTTC 3050 

GGTCTTGTTG GATATGGTTT TCAGCTCAAA GCTTACAAAG CTGTTTCTGA 3100 

GCCTTTCTCA AAAAGGCTTG CTCAGTAATA TTGAGGTGCT AAAGTTGATA 3150 

CATGTGACTC TTGCTTATAA ATCCTCCGTT TGGTTTGTTC TGCTTTTTCA 3200 

GATTACCGAA TGCTCCTGAG ATGGAAATCT ACTCATTATA CGGAGTGGGG 3250 
ATACCAACGG AACGAGCATA CGTATACAAG CTTAACCAGT CTCCCGACAG 3300 

TTGCATCCCC TTTCAGATAT TCACTTCTGC TCACGAGGAG GACGAAGATA 3350 

GCTGTCTGAA AGCAGGAGTT TACAATGTGG ATGGGGATGA AACAGTACCC 3400 

GTCCTAAGTG CCGGGTACAT GTGTGCAAAA GCGTGGCGTG GCAAGACAAG 3450 

ATTCAACCCT TCCGGAATCA AGACTTATAT AAGAGAATAC AATCACTCTC 3500 

CGCCGGCTAA CCTGTTGGAA GGGCGCGGGA CGCAGAGTGG TGCCCATGTT 3550 

GATATCATGG GAAACTTTGC TTTGATCGAA GATATCATGA GGGTTGCCGC 3600 

CGGAGGTAAC GGGTCTGATA TAGGACATGA CCAGGTCCAC TCTGGCATAT 3 650 
TTGAATGGTC GGAGCGTATT GACCTGAAGC TGTGA 3685 
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<210> 5 ' 
<211> 2427 
<212> cDNA 

<213> Arabidopsis thaliana 



<400> 5 

AGAAACAGCT CTTTGTCTCT CTCGACTGAT CTAACAATCC CTAATCTGTG 50 

TTCTAAATTC CTGGACGAGA TTTGACAAAG TCCGTATAGC TTAACCTGGT 100 

TTAATTTCAA GTGACAGATA TGCCCCTTAT TCATCGGAAA AAGCCGACGG 150 

AGAAACCATC GACGCCGCCA TCTGAAGAGG TGGTGCACGA TGAGGATTCG 200 

CAAAAGAAAC CACACGAATC TTCCAAATCC CACCATAAGA AATCGAACGG 250 

AGGAGGGAAG TGGTCGTGCA TCGATTCTTG TTGTTGGTTC ATTGGGTGTG 300 

TGTGTGTAAC CTGGTGGTTT CTTCTCTTCC TTTACAACGC AATGCCTGCG 350 

AGCTTCCCTC AGTATGTAAC GGAGCGAATC ACGGGTCCTT TGCCTGACCC 400 
GCCCGGTGTT AAGCTCAAAA AAAGAAGGTC TTAAGGCGAA ACATCCTGTT 450 

GTCTTCATTC CTGGGATTGT CACCGGTGGG CTCGAGCTTT GGGAAGGCAA 500 

ACAATGCGCT GATGGTTTAT TTAGAAAACG TTTGTGGGGT GGAACTTTTG 550 

GTGAAGTCTA CAAAAGGCCT CTATGTTGGG TGGAACACAT GTCACTTGAC 600 

AATGAAACTG GGTTGGATCC AGCTGGTATT AGAGTTCGAG CTGTATCAGG 650 

ACTCGTGGCT GCTGACTACT TTGCTCCTGG CTACTTTGTC TGGGCAGTGC 700 

TGATTGCTAA CCTTGCACAT ATTGGATATG AAGAGAAAAA TATGTACATG 750 

GCTGCATATG ACTGGCGGCT TTCGTTTCAG AACACAGAGG TACGTGATCA 800 

GACTCTTAGC CGTATGAAAA GTAATATAGA GTTGATGGTT TCTACCAACG 850 

GTGGAAAAAA AGCAGTTATA GTTCCGCATT CCATGGGGGT CTTGTATTTT 900 

CTACATTTTA TGAAGTGGGT TGAGGCACCA GCTCCTCTGG GTGGCGGGGG 950 
TGGGCCAGAT TGGTGTGCAA AGTATATTAA GGCGGTGATG AACATTGGTG 1000 

GACCATTTCT TGGTGTTCCA AAAGCTGTTG CAGGGCTTTT CTCTGCTGAA 1050 

GCAAAGGATG TTGCAGTTGC CAGAGCGATT GCCCCAGGAT TCTTAGACAC 1100 

CGATATATTT AGACTTCAGA CCTTGCAGCA TGTAATGAGA ATGACACGCA 1150 

CATGGGACTC AACAATGTCT ATGTTACCGA AGGGAGGTGA CACGATATGG 1200 

GGCGGGCTTG ATTGGTCACC GGAGAAAGGC CACACCTGTT GTGGGAAAAA 1250 

GCAAAAGAAC AACGAAACTT GTGGTGAAGC AGGTGAAAAC GGAGTTTCCA 1300 

AGAAAAGTCC TGTTAACTAT GGAAGGATGA TATCTTTTGG GAAAGAAGTA 1350 

GCAGAGGCTG CGCCATCTGA GATTAATAAT ATTGATTTTC GAGGTGCTGT 1400 

CAAAGGTCAG AGTATCCCAA ATCACACCTG TCGTGACGTG TGGACAGAGT 1450 

ACCATGACAT GGGAATTGCT GGG ATCAAAG CTATCGCTGA GTATAAGGTC 1500 

TACACTGCTG GTGAAGCTAT AGATCTACTA CATTATGTTG CTCCTAAGAT 1550 

GATGGCGCGT GGTGCCGCTC ATTTCTCTTA TGGAATTGCT GATGATTTGG 1600 

ATGACACCAA GTATCAAGAT CCCAAATACT GGTCAAATCC GTTAGAGACA 1650 

AAATTACCGA ATGCTCCTGA GATGGAAATC TACTCATTAT ACGGAGTGGG 1700 

GATACCAACG GAACGAGCAT ACGTATACAA GCTTAACCAG TCTCCCGACA 1750 

GTTGCATCCC CTTTCAGATA TTCACTTCTG CTCACGAGGA GGACGAAGAT 1800 

AGCTGTCTGA AAGCAGGAGT TTACAATGTG GATGGGGATG AAACAGTACC 1850 

CGTCCTAAGT GCCGGGTACA TGTGTGCAAA AGCGTGGCGT GGCAAGACAA 1900 

GATTCAACCC TTCCGGAATC AAGACTTATA TAAGAGAATA CAATCACTCT 1950 

CCGCCGGCTA ACCTGTTGGA AGGGCGCGGG ACGCAGAGTG GTGCCCATGT 2000 

TGATATCATG GGAAACTTTG CTTTGATCGA AGATATCATG AGGGTTGCCG 2050 

CCGGAGGTAA CGGGTCTGAT ATAGGACATG ACCAGGTCCA CTCTGGCATA 2100 

TTTGAATGGT CGGAGCGTAT TGACCTGAAG CTGTGAATAT CATGATCTCT 2150 

TTAAGCTGTC CTGTCAGCTT ATGTGAATCC AATACTTTGA AAGAGAGATC 2200 

ATCATCAATT CATCATCATC GTCATCATCA TGATGCTCAA CTCACAAAGA 2250 

AGCCTGAGAA TGATACTTTG GTgCGAAATT CTCAATACCT CTTTAATATT 2300 

CTTATTGAAT GTAAATTATA CAATCCTATC TAATGTTTGA ACGATAACAC 2350 

AAAACTTGCT GCNGCCATGT TTGTTTGTCT TGTCAAAAGC ATCAATTTGT 2400 
GGGTTAAAAA AAAAAAAAAA AAAAAAA 2427 
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<210>" 6 ~ 
<211> 671 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 6 

MPLIHRKKPT EKPSTPPSEE WHDEDSQKK PHESSKSHHK KSNGGGKWSC 50 

IDSCCWFIGC VCVTWWFLLF LYNAMPASFP QYVTERITGF LPDPPGVKLK 100 

KEGLKAKHPV VFIPGIVTGG LELWEGKQCA DGLFRKRLWG GTFGEVYKRP 150 

LCWVEHMSLD NETGLDPAGI RVRAVSGLVA ADYFAPGYFV WAVL.IANLAH 200 

IGYEEKNMYM AAYDWRLSFQ NTEVRDQTLS RMKSNIELMV STNGGKKAVI 250 

VPHSMGVIiYF LHFMKWVEAP APLGGGGGPD WCAKYIKAVM NIGGPFLGVP 300 

KAVAGLFSAE AKDVAVARAI APGFLDTDIF RLQTLQHVMR MTRTWDSTMS 350 

MLPKGGDTIW GGLDWSPEKG HTCCGKKQKN NETCGEAGEN GVSKKSPVNY 400 

GRMISFGKEV AEAAPSEINN IDFRGAVKGQ SIPNHTCRDV WTEYHDMGIA 450 

GIKAIAEYKV YTAGEAIDLL HYVAPKMMAR GAAHFSYGIA DDLDDTKYQD 500 

PKYWSNPLET KLPNAPEMEI YSLYGVGIPT ERAYVYKLNQ SPDSCIPFQI 550 

FTSAHEEDED SCLKAGVYNV DGDETVPVLS AGYMCAKAWR GKTRFNPSGI 600 

KTYIREYNHS PPANLLEGRG TQSGAHVDIM GNFALIEDIM RVAAGGNGSD 650 
IGHDQVHSGI FEWSERIDLK L 671 
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<21C> 7 
<211> 643 
<212> cDNA 
<213> Zea mays 

<221> CDS 

<222> (1) . . (402) 



PCT/EPOO/02701 



<400> 5 

CGG GAG AAA ATA GCT GCT TTG AAG GGG GGT GTT TAG TTA GCC GAT GGT 48 

Arg Glu Lys lie Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp Gly 
15 10 15 

GAT GAA ACT GTT CCA GTT CTT AGT GCG GGC T^C ATG TGT GCG AAA GGA 96 
Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 

TGG CGT GGC AAA ACT CGT TTC AGC CCT GCC GGC AGC AAG ACT TAC GTG 144 
Trp Arg Gly Lys Thr Arg Phe Ser Pro Ala Gly Ser Lys Thr Tyr Val 
35 40 45 

AGA GAA TAC AGC CAT TCG CCA CCC TCT ACT CTC CTG GAA GGC AGG GGC 192 
Arg Glu T-vT Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 ' 55 60 



ACC CAG AGC GGT GCA CAT GTT GAT ATA ATG GGG AAC TTT GCT CTA ATT 240 
Thr Gin Ser Gly Ala His Val Asp lie Met Gly Asn Phe Ala Leu lie 
65 70 75 80 

GAG GAC GTC ATC AGA ATA GCT GCT GGG GCA ACC GGT GAG GAA ATT GGT 288 
Glu Asp Val lie Arg lie Ala Ala Gly Ala Thr Gly Glu Glu lie Gly 

85 90 95 

GGC GAT CAG GTT TAT TCA GAT ATA TTC AAG TGG TCA GAG AAA ATC AAA 336 
Gly Asp Gin Val Tyr Ser Asp lie Phe Lys Trp Ser Glu Lys lie Lys 
100 105 110 

TTG AAA TTG TAA CCTATGGGAA GTTAAAGAAG TGCCGACCCG TTTATTGCGTTCC 391 
Leu Lys Leu 
115 

AAAGTGTCCT GCCTGAGTGC AACTCTGGAT TTTGCTTAAA TATTGTAATT TTTCACGC 449 
TTCATTCGTC CCTTTGTCAA ATTTACATTT GACAGGACGC CAATGCGATA CGATGTTG 507 
TACCGCTATT TTCAGCATTG TATATTAAAC TGTACAGGTG TAAGTTGCAT TTGCCAGC 565 
TGAAATTGTG TAGTCGTTTT CTTTACGATT TAATANCAAG TGGCGGAGCA GTGCCCCA 623 
AGCNAAAAAA AAAAAAAAAA 643 
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<2itr5r-8 

<211> 115 
<212> PRT 
<213> Zea mays 

<400> 8 

Arg Glu Lys lie Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp Gly 
15 10 15 

Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 

Trp Arg Gly Lys Thr Arg Phe Ser Pro Ala Gly Ser Lys Thr Tyr Val 
35 40 45 

Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 55 60 

Thr Gin Ser Gly Ala His Val Asp He Mec Gly Asn Phe Ala Leu He 
65 70 75 80 

Glu Asp Val He Arg He Ala Ala Gly Ala Thr Gly Glu Glu He Gly 
85 90 95 

Gly Asp Gin Val Tyr Ser Asp He Phe Lys Trp Ser Glu Lys He Lys 
100 105 110 



Leu Lys Leu 
115 
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<2ro>~9 

<211> 616 
<212> cDNA 

<213> Neurospora crassa 
<400> 9 

ggcggcgaag acganggcgg aagttggagg ctaacgagaa tgacnctcgg 50 

agatggatct accctccaga gacacgacta ccnttgcacc cagccccaag 100 

gtntacngtt tntacgggta ggaagccgac ggagcgagcc tacatctatc 150 

tggcgcccga tcccgggacg acaacgcatc tttagatgac gatcgatacg 200 

actttgactn aggggcacat tgaccacggt gtgattttgg gcgaaggcga 250 

tggcacagcg aaccctatga gtttggggta cctgcgcaat aaggggtgga 3 00 

aaatgaagag atacaaccct gcgggctcaa aaataaccgt ggccgagatg 350 

ccgcatgaac cagaacggtt caatccgaga ggagggccga atacggcgga 400 

tcacgtggac attctaggaa ggcagaatct aaacgagtac attcttaaag 450 

tggcggcagg tcgaggcgac acaattgagg atcttattac tagtaatatc 500 

cttaaacarg tagaaaaggt tgaaatttat gaagagtaat taaatacggc 550 

acataggtta cncaatagta tgactaatta aaaaaaaatt ttttttccaa, 600 
aaaaaaaaaa aaaaaa 616 
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<210'>~T0 

<211> 1562 

<212> genomic DNA 

<213> Arabidopsis thaliana 

<400> 10 

ATGAAAAAAA TATCTTCACA TTATTCGGTA GTCATAGCGA TACTCGTTGT 5 
GGTGACGATG ACCTCGATGT GTCAAGCTGT GGGTAGCAAC GTGTACCCTT 
TGATTCTGGT TCCAGGAAAC GGAGGTAACC AGCTAGAGGT ACGGCTGGAC 
AGAGAATACA AGCCAAGTAG TGTCTGGTGT AGCAGCTGGT TATATCCGAT 
TCATAAGAAG AGTGGTGGAT GGTTTAGGCT ATGGTTCGAT GCAGCAGTGT 
TATTGTCTCC CTTCACCAGG TGCTTCAGCG ATCGAATGAT GTTGTACTAT 
GACCCTGATT TGGATGATTA CCAAAATGCT CCTGGTGTCC AAACCCGGGT 
TCCTCATTTC GGTTCGACCA AATCACTTCT ATACCTCGAC CCTCGTCTCC 
GGTTAGTACT TTCCAAGATA TATCATTTTG GGACATTTGC ATAATGAACA 
AAATAGACAT AAATTTGGGG GATTATTGTT ATATCAATAT CCATTTATAT 
GCTAGTCGGT AATGTGAGTG TTATGTTAGT ATAGTTAATG TGAGTGTTAT 
GTGATTTTCC ATTTTAAATG AAGCTAGAAA GTTGTCGTTT AATAATGTTG 
CTATGTCATG AGAATTATAA GGACACTATG TAAATGTAGC TTAATAATAA 
GGTTTGATTT GCAGAGATGC CACATCTTAC ATGGAACATT TGGTGAAAGC 
TCTAGAGAAA AAATGCGGGT ATGTTAACGA CCAAACCATC CTAGGAGCTC 
CATATGATTT CAGGTACGGC CTGGCTGCTT CGGGCCACCC GTCCCGTGTA 
GCCTCACAGT TCCTACAAGA CCTCAAACAA TTGGTGGAAA AAACTAGCAG 
CGAGAACGAA GGAAAGCCAG TGATACTCCT CTCCCATAGC CTAGGAGGAC 
TTTTCGTCCT CCATTTCCTC AACCGTACCA CCCCTTCATG GCGCCGCAAG 
TACATCAAAC ACTTTGTTGC ACTCGCTGCG CCATGGGGTG GGACGATCTC 
TCAGATGAAG ACATTTGCTT CTGGCAACAC ACTCGGTGTC CCTTTAGTTA 
ACCCTTTGCT GGTCAGACGG CATCAGAGGA CCTCCGAGAG TAACCAATGG 
CTACTTCCAT CTACCAAAGT GTTTCACGAC AGAACTAAAC CGCTTGTCGT 
AACTCCCCAG GTTAACTACA CAGCTTACGA GATGGATCGG TTTTTTGCAG 
ACATTGGATT CTCACAAGGA GTTGTGCCTT ACAAGACAAG AGTGTTGCCT 
TTA^ICAGAGG AGCTGATGAC TCCGGGAGTG CCAGTCACTT GCATATATGG 
GAGAGGAGTT GATACACCGG AGGTTTTGAT GTATGGAAAA GGAGGATTCG 
ATAAGCAACC AGAGATTAAG TATGGAGATG GAGATGGGAC GGTTAATTTG 
GCGAGCTTAG CAGCTTTGAA AGTCGATAGC TTGAACACCG TAGAGATTGA 
TGGAGTTTCG CATACATCTA TACTTAAAGA CGAGATCGCA CTTAAAGAGA 
TTATGAAGCA GATTTCAATT ATTAATTATG AATTAGCCAA TGTTAATGCC 
GTCAATGAAT GA 1562 



100 

150 

200 

250 

300 

350 

400 

450 

500 
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600 

650 
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1100 
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1300 
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1400 

1450 

1500 

1550 



SUBSTSTOTE SHEET (RUJILE 26) 



aXJCID: <iWO_0060C»5A?_l_> 



wo 00/60095 




PCT/EPOO/02701 



<21(h> ii 

<211> 3896 

<212> genomic DNA 

<213> Arabidopsis thaliana 



<400> 11 

ATGGGAGCGA ATTCGAAATC AGTAACGGCT TCCTTCACCG TCATCGCCGT 50 

TTTTTTCTTG ATTTGCGGTG GCCGAACTGC GGTGGAGGAT GAGACCGAGT 100 

TTCACGGCGA CTACTCGAAG CTATCGGGTA TAATCATTCC GGGATTTGCG 150 

TCGACGCAGC TACGAGCGTG GTCGATCCTT GACTGTCCAT ACACTCCGTT 200 

GGACTTCAAT CCGCTCGACC TCGTATGGCT AGACACCACT AAGGTCCGTG 250 

ATCTTCATTT CCTTCGCTCC TTATTCTGTC GGTCGAGTCA CTTGTTGATG 300 

AATTCCAAGC GAAATATAGC AATGAAGCAT GTCTCGTCTC TCTTATTGAT 350 

TCGTTCATTA GTCAACAGTG ACGCTTCTGA ATCTGAGTTT AGAGTCATAT 400 

AAAACAGCTG ACTCGGCGAG TGTTTCCCAT CGCTTTTGGT TCGCTAAATG 450 

TAGCGCAATG AATGTGTAAT TAGTCTGCGC TTTTTATTCA ACTAGATCTG 500 

CAAGTTTTTC AGAGTGCTCA ATAGTAGTTA GAAAATGTTA GGTCATTTTA 550 

CTTGTGCATT GTGATTCTTT TGGTTGTTGC TTACTGATCG ACGTGATGGA 600 

TGGTTTACAG CTTCTTTCTG CTGTCAACTG CTGGTTTAAG TGTATGGTGC 650 

TAGATCCTTA TAATCAAACA GACCATCCCG AGTGTAAGTC ACGGCCTGAC 700 

AGTGGTCTTT CAGCCATCAC AGAATTGGAT CCAGGTTACA TAACAGGTAG 750 

TTTCGGATTT TTCTTTCTTT TGAGTTTTCT TCAATTTGAT ATCATCTTGT. 800 

TGTGATATAA TATGGCTAAG TTCATTAATT TGGTCAATTT TCAGGTCCTC 850 

TTTCTACTGT CTGGAAAGAG TGGCTTAAGT GGTGTGTTGA GTTTGGTATA 900 

GAAGCAAATG CAATTGTCGC TGTTCCATAC GATTGGAGAT TGTCACCAAC 950 

CAAATTGGAA GAGCGTGACC TTTACTTTCA CAAGCTCAAG TTAGTCCTTA 1000 

TCAGGCTAAT GTCTTTTATC TTCTCTTTTT ATGTAAGATA AGCTAAGAGC 1050 

TCTGGTCGTC TTCCTTTTTG CAGGTTGACC TTTGAAACTG CTTTAAAACT 1100 

CCGTGGCGGC CCTTCTATAG TATTTGCCCA TTCAATGGGT AATAATGTCT 1150 

TCAGATACTT TCTGGAATGG CTGAGGCTAG AAATTGCACC AAAACATTAT 1200 

TTGAAGTGGC TTGATCAGCA TATCCATGCT TATTTCGCTG TTGGTACCGG 1250 

CCTACTATCC TTAAGTTACC ATTTTATTTT TTCTCTAATT GGGGGAGTTA 1300 

TGTTGTGACT TACTGGATTG AGCTCGATAC CTGATTTGTT GTTGATTTAG 1350 

GAGCTCCTCT TCTTGGTTCT GTTGAGGCAA TCAAATCTAC TCTCTCTGGT 1400 

GTAACGTTTG GCCTTCCTGT TTCTGAGGTG ACCTCTGACT TCTCTTTAGT 1450 

TTTAAGTAGT TGATATCAAC CAGGTCTTAT AACTCACTGG ATTTTCCTTT 1500 

TGAAAGTATT ACTTTTGTTA ATTGAACTGC TGTACGCGAT ATGGTATCTG 1550 

TAGATCTTGA AGTGCTAGTT ATCAAAGAAC ATATTGTGGG TAGTATACCT 1600 

GTCAGCGGCC TTAGCTAATA CAACCAAACC ACATGTACAC TGATTTAGTT 1650 

TTCAGATTAT TATGGTAGAC TTTAAGTTGA GAAGAAACTT TGACTGAAAT 1700 

CTTTTTATTT TAATAGGCTA TGATTTGTTT ATTGAAATCA TGTGACATAT 1750 

TGACATGCGC TTCTCATGTT TTTTGTTGGC AAGGCTTCAG GGAACTGCTC 1800 

GGTTGTTGTC CAATTCTTTT GCGTCGTCAT TGTGGCTTAT GCCATTTTCA 1850 

AAGAATTGCA AGGGTGATAA CACATTCTGG ACGCATTTTT CTGGGGGTGC 1900 

TGCAAAGAAA GATAAGCGCG TATACCACTG TGATGAAGAG GAATATCAAT 1950 

CAAAATATTC TGGCTGGCCG ACAAATATTA TTAACATTGA AATTCCTTCC 2000 

ACTAGCGGTT AGACTCTGTA TATGCAACTG TAACACTAAC AAAAGTTTCA 2050 

CCAAGAATGT TCACTCTCAT ATTTCGTTCG TTTGATGTGT ATCCATCAGT 2100 

TACAGAAACA GCTCTAGTCA ACATGACCAG CATGGAATGT GGCCTTCCCA 2150 

CCCTTTTGTC TTTCACAGCC CGTGAACTAG CAGATGGGAC TCTTTTCAAA 2200 

GCAATAGAAG ACTATGACCC AGATAGCAAG AGGATGTTAC ACCAGTTAAA 2250 

GAAGTACGTA CCTTTCTTTG TGATAAGAAA TATTGCTCAT CGATCATCAC 2300 

TTGCTGGCTT CTTGTACGTC AAATTGTTTT GTTTAAATCT CTATATCAAT 2350 

TGTTCATATG CTTTGTCTTT CTTACTATAA GAAACAAGTA. TAATCAGAAA 2400 

CCTTATTATT GATTATCAGT TCTCTCCTTA TATTATGGAA TGTCTTTTTC 24 50 

GTTTACAGTT ATGAATGCAA AAGGGGGTAT TTTAGTTGAT TGATTCTCTC 2500 

ATTCTCTAGT TTGTTTTGAC TAATAGCGTC AATTTTGTTT TTCTAGCAAA 2550 

TCTTTGTGAA TTATATATAA CATGCTAACT ATACTTTTCA GGTTGTATCA 2600 

TGATGACCCT GTTTTTAATC CTCTGACTCC TTGGGAGAGA CCACCTATAA 2650 

AAAATGTATT TTGCATATAT GGTGCTCATC TAAAGACAGA GGTATGATGC 2700 

ATTCTCAATA TCACATTATG CGTTGACTTT GTTATTATAT TCCCCATTTG 2750 

GTTTGCAATA TCTTTTTGAA TTATGATTTA TCTTCTCCCT TGCATCTTAT 2800 

GCTATTAAGC GTTAAAGGTA CTAAATGTAT GAAGCTGTCT GTCATAGGTT 2850 



SUBSTITUTE SHEET (RULE 26) 



iOCID: <WO O06O095A2 I > 



wo 00/60095 



GGTTATTACT 

CACGGATATC 

CCGCAATGGC 

GTGGCATGTT 

AAGTACTTTT 

AAGTGGGAAG 

AGCAAAACAA 

GTGCTTTTAA 

GATTGTGCAA 

CCTATAACTG 

CTTCTTGCAA 

CTTGCTATGT 

TGATTATGAA 

GCAAGAATTG 

CTCTTTTTTA 

CTGGTTATGT 

TCTCTGTACT 

GAAAATAAAA 

TAAATGTTGA 

GCACCAAGGG 

GGGGAAGAGA 



TTGCCCCAAG 

ATTTATGAAA 

AGAAGTAAAA 

ATCTCAGTTG 

TTATCATTCC 

AGGTGTTGCA 

AACTAACCCA 

AAAATTTGTT 

TATCTGCAGG 

GGGATGAGAC 

ACTACTGAAG 

TCTCTAGTAC 

ATTGATCTCT 

GCTCGGACCT 

GTTCCTCACC 

GTTGATTTAC 

CCTCAAGAAC 

CAACAGCCAG 

TCATGAGCAT 

TTAAGTACAT 

ACCGCAGTCT 



TGGCAAAC 
CTGAAGGTTC 
CAGGAAGGCA 
CATAAGCAAA 
TTTTGAGCTT 
TGAAACATGA 
TTTCTGAATT 
TTAAGAAACC 
TCTGGAACTG 
GGTAAGCTCA 
ACTAAGATAA 
ACTGCAATAT 
TATAGGTACC 
AAAGTTAACA 
TTATATAGAT 
CTCCAATTTG 
TTGTATTAAT 
AACACGATGG 
GGGTCAGACA 
AACCTTTTAT 
GGGAGCTTGA 
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'STGCTGATA 
CCTCGTGTCA 
AAGTCTTCTG 
TTATTAAACA 
AGTGGATGAT 
CACTTGTATC 
TCATATTATT 
GAAAAACTAG 
TGGTTGATGG 
GAAGTTGGTT 
TACTTGCTTC 
TGACTCTCCG 
CTATCATTCA 
TAACAATGGC 
CAAACTTTAA 
TTCTTTCTAA 
CTAAACGAGA 
AAGCGACGTA 
TCATAGCTAA 
GAAGACTCTG 
TAAAAGTGGG 
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AT.TGGATCAT"~" 

AGGTAATTTT 

TATCAGTCTA 

ACTAAAATTT 

CAGTGGCTTA 

AAAGATAACT 

AGGAGTAGTC 

TTCATATCTT 

GAACGCTGGA 

TTGAAATTAT 

TGGAACACTG 

CTACTTTTAT 

CTCTCTTGGT 

TCCCCAGGTA 

GTGTACTTTT 

AAATCATATA 

TTCTCATTGG 

CATGTGGAAC 

CATGACAAAA 

AGAGCATTCC 

TATTAA 



3300 



2* 900 — 

2950 

3000 

3050 

3100 

3150 

3200 

3250 

3350 
3400 
3450 
3500 
3550 
3600 
3650 
3700 
3750 
3800 
3850 



3896 
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<2ia>" i"T~ ' 

<211> 709 
<212> cDNA 

<213> Lycopersicon esculent um 
<400> 12 

CTGGGGCCAA AAGTGAACAT AACAAGGACA CCACAGTCAG AGCATGATGT 50 

TCAGATGTAC AAGTGCATCT AAATATAGAG CATCAACATG GTGAAGATAT 100 

CATTCCCAAT ATGACAAAGT TACCTACAAT GAAGTACATA ACCTATTATG 150 

AGGATTCTGA AAGTTTTCCA GGGACAAGAA CAGCAGTTTG GGAGCTTGAT 200 

AAAGCAAATC ACAGGAACAT TGTCAGATCT CCAGCTTTGA TGCGGGAGCT 250 

GTGGCTTGAG ATGTGGCATG ATATTCATCC TGATAAAAAG TCCAAGTTTG 300 

TTACAAAAGG TGGTGTCTGA TCCTCACTAT TTTCTTCTAT AAATGTTTGA 350 

GTTTGTATTG ACATTGTAAG TATTGCAACA AAAAGCAAAG CGTGGGCCTC 400 

TGAGGGATGA GGACTGCTAT TGGGATTACG GGAAAGCTCG ATGTGCATGG 450 

GCTGAACATT GTGAATACAG GTTAGAATAT TCAAATTATA TTTTGCAAAA 500 

TATTCTCTTT TTGTGTATTT AGGCCACCTT TCCCCGGTCA CAACGATGCA 550 

GATATGTATT CGGGGATGTT CACCTGGGAC AGAGTTGCAG ATTGAAGAGT 600 

TCTACATCTC ACATCCTGTC ACACTATGTG TGATATTTAA GAAACTTTGT 650 

TTGGCGGAAC AACAAGTTTG CACAAACATT TGAAGAAGAA AGCGAAATGA 700 

TTCAGAGAG 709 
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<210> 13 
<211> 623 
<212> PRT 

<213> Schizosaccharomyces powhe 
<400> 13 

MASSKKSKTHKKKKFVKSPIDLPNSIOCPTI^SEQPSASETQSVSNKSI^ 

FFFAVGDDNAVFDPATLDKFGNMLGSSDLFDDIKGYLSYNWKDAPFTTDKPSQSPSGISEVQVGI^ 
GYRSDHPVIMVPGVI SSGLESWSFl^CSI PYFRKRLWGSWSMLKAMFLDKQC^n^EHLM^ 1 0 

KLRAAQGFEAADFF I TG YWIWSKVIENLAAIGYEPNNMLS AS YDV^ 

IVHKKKVVTJISHSMGSQWYYFFKWVEAEGYGNGGPTWVNDHIEA^ INI SGSLIGAPKTVAALLSGEMKD3 5 0 
TGIVITLNILEKFFSRSERAMMVRTMGGVSSMLPKGGDVAPDDLNQTNFSNGAIIRYREDIDKDHDEFDI420 

DDALQFLKNVTDDDFKVMXiAiO^SHGLAWTEKEVLKNNEMPSK^ 9 0 

ERGYYYTNNPEGQPVIDSSVOTGTKVENGIV]ynDDGDGTLPILALGLVCNKW 

HEPAAFDLRGGPRSAEKVDILGHSELNEIILKVSSGHGDSVPNRYISDIQEIINEXNLDKPRN 623 



.ooeoo85Ae,L> 
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<210.> 14 
<211> 432 
<212> PRT 

<213> Arahidopsls thai! ana 
<400> 14 

MKKISSKYSWIAILVVVTMTSMCQAVGSNVYPLILVPGNGGNQLEVRLDr^YK^ 
SGGWFRLWFDAAVT.LSPFTRCFSDR]^yYDPDLDDYQNAPGVQTRVPHFGSTK5LLYIJDPRLra^ 
EHLVKALEKKCGYVNDQTILGAPYDFRYGLAASGHPSRVASQFLQDLKQLVEKTSSENEGKPVILLSHSL2 1 0 
GGLFVLHFLNRTTPSWRRKYIKHFVALAAPWGGTISQl^TFASGNTLGVPLVOTLL^^ 8 0 

LPSTKVFHDRTKPLVVTPQVOTTAYEMDRFFADIGFSQGWPYKTRVLPLTEEU^ 5 0 

TPEVLMYGKGGFDKQPEIKYGDGDGTVNLASLAALKVDSLNTVSIDGVSHTSILKDEIA^^ 
NYELANVNAVNE 432 
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<2i0> 15 
<211> 552 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 15 

mgansks'vrras?tviavfflicggrtavedetefhgdysklsgiiipgfastqlrawsildcpytpldfn70 
pldl vvldttklls awcwkcm\n^dpynqtdkpecksrpdsglsaiteldpgyitgplstv^^ 0 
%'^rgieanaivavpydwrlsptkleerdlyfhklkltfetalklrggpsivfahsmgnnvf 
e:a?k>:ylkwi^dqhikayfavgapllgsveaikstlsgvtfglpvsegtarllsnsfasslwi^ 
kgdntfvvthrsggaakkdkrvyhcdeeeyqskysgwptniinieipstsarei^gtl^ 
f^.lhclkjcyv?ffviiwiahrsslagfllyhddpwnpltpwerppik]srvtciy 
k?y?3^^aiitdiiyetegslvsrsgtvvdgnagpitgdetvpyhslswcknwi^ 
kqqfehdgsdvhvelnvdhehgsdiial^tkaprvkyitfyedsesipgkrtavwe^ 552 



> ^0060095A2_L> 
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10 



15 



<X70> Pacentln ver. 2.0 



<210> IQu 
<211> 661 
20 <212> PHT 

<213> Sacciiaromyces cerevisiae 



M"°Gli Thr Leu Phe Arc Arg Asn Val Gin Asn Gin Lvs Ser Asp Ser 
25 1 5 10 

ASP Glu Asr. Asr. Lys Giy Giy Ser Val His Asr. uys Arg Glu Ser Arg 



20 



30 Asn Kis Xle His His Gin Gin Giy Leu Giy Kis Lys Arg Arg Arg Giy 
35 40 

lis Ser Glv Ser Ala Lvs Arg Asn. Glu Arg Giy Lys Asp Phe Asp Arg 
50 ' " 55 60 

Lvs Arg Aso Giy Asn Giy Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 * 70 73 

lis Pile He Leu Giy Ala ?he Leu Giy Val Leu Leu Pro Phe Ser PHe 
40 35 92 95 

Glv Ala Tvr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
" 100 105 "0 

45 Val Asn 'he Asc Ser Leu Lvs Val Tyr Leu Asp As? Trp Lys As? Val 
115 " 120 125 

Leu Pro Gin Glv He Ser Ser Phe Zie Asp As? lis C-ln Ala Giy Asn 
130 " 135. 140 

T--/r Sar rhr Ser Ser Leu Asp -As? Leu Ser Glu Asn Phe Ala Val Giy 
115 150 1== 

Lys Gin Leu Leu Arg As? Tyr Asn "Is Glu Ala Lys His Pro Val Val 
55 155 170 17= 

Met Val Pre Glv Val He Ser Thr Giy He Glu Ssr Tr? C-ly Val He 

135 190 



130 
Glu _ 

155 200 



60 Giy A3? AS? Glu Cys As? Ser Ser Ala His Phe Arc Lys Arg Leu T_.- 
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Gly ser Tyr Me. L.. «| T^- Hec Val M« I-vs V.1 Cys Trp 
... r= Hi. v=a «« .e. »SP «o Glu «V "P -° 
Ill ^ ..u Kl| GIT. Gly Ph. Glu s.r «x «P Tyr .|e He 
^e Gly Tyr Trp Til Trp «n .ye vjl GX. As. ..u Gly val XX. 
GXy Ty. GXU Z :.y= ^ ^ 



Tyr «P .eu GXu «| «a -V^ - ''^^ 

290 ^^^^ 



20 GX. GX= n. GX. Leu P« Hie =1= Leu GXy GXu Lye VeX cy. Leu 

305 

XX. GXy HXe se. He. OXy Se. GX, XXe XXe ... Ty. P.. M.. Lye ^ 

" vex GXU ^e GXu Z Pro L.U Tyr GXy ^ GXy GXy ^« Oly Trp V.X 

340 

^„ GXU His XX. «P =er P.. XX. AXe ^e GXy Leu Leu GXy 
30 335 ' 

;^e pro Lye XXe Pro Leu XX. Ser GXy GXu Me. Lye «P 

35 XX. '1 L.u «n r^' L.U «a M.-. Tyr GXy Leu GXu Lye P^e 

385 

fvi^^^ 'p— ^ Glv GlY Xle Pro Ser 
Arc Tie Glu Arg V^l Lys Met Leu Gin Ttu -rp Gly Giy 

405 

40 ^, T-, o T-t> GW Aso Met Lys Ser 3ar 

Me? Leu Pro Lys Gly Glu Glu Val ^re .rp - ^^o 

420 

, r AS- Asn Thr Asts Thr Tyr Gly Asn Phe He 

Ser Glu ksp Ala Leu Asa As.. Asn 

. ^u- Ala Phe ASH Lys Asn Leu Thr Met 

AT? Phe Glu Arc Asn Thr Se. .^P 
450 

T T1 a ca- Pro Glu Trp Leu Gin 

50 Lvs ASP Ala He Asr. Met Tnr Leu Se 480 

4=5 

. ^. Pile Gly Tyr Ser Lys Asn Glu Glu 

Ar= Arc Val His Giu 493 

485 

«• TVS ---^s Tro Ser Asr. Pro Met 

, T,ai- His tils UyS .— a i-i' - _ 

C-lu Leu .ijrg Lys Asr. o^- ^ 505 
500 



,1. o^o ---^s Met Lys lie Tyr Cys He Tyr 
Glu val Pro Leu Pro Giu Ai.a ?-o — * 



60 315 
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mi. r'-i , ii-rr tla -^V^ Val TVT Lvs Glu Glu Asp 
Glv Val Asn Asn Pro Thr Glu Arg Ala Tyr vax . 

' 530 535 

Ser Ser Ala l.eu i^^n Leu Tixr He Asp Tyr Glu Ser Lys Gin Pro 
5 54*5 530 5dd 

V- Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala Hi| Ser 



565 



570 



10 Me. Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tvr Asn Pro Ala Gly 
580 58d 
lie V.1 Thr lie Val Glu Me. Lys His Gin Pro As| Arg Phe Asp 



555 



600 



lie AT. Gly Gly Ala Lys Ser Ala Clu His Val Asp He Leu Gly Ser 
Ala Glu Leu Asn Asp Tyr He Leu Lys> He Ala Ser Gly Asn Gly Asp 



20 525 630 "5 



Gin Trp Val Ser Gin 



Leu Val Glu Pre Arg Gin Leu Ser Asn Leu Ser 

645 

25 filer. Pro Phe Pre Met: 
660 

<210> 2CU 
30 <211> 387 
<212> ?RT 

<213> J^jrabidopsis Chaliana 

<400> 2 . ^ Leu VaT Pro Glv Asn Gly Gly 
35 Val Glv Ser Asn Val Tyr Pro Leu xie Leu va. « . 

1*5 10 
Asn Gin Leu Glu Val Arg I-eu Asp Arg Glu Tyr Lys Pro Ser Ser Val 

20 2 = 

Trp cys ser Ser Trp Leu Tyr Pro Xle His Lys Lys Ser Gly Gly Trp 



35 40 
Phe Arg Leu Tro Phe Asp Ala Ala Val Leu Leu Ser Pro Phe Thr Arg 
45 50 55 



- . , Mai- "^j— Tv~ AST5 Pro Asp Leu Asp Asp 

Cvs Phe Ser Asp Arc Met Met --if - - gg 

55 70 7= 

- ^ T'a- n- "Hr J^-o Val Pro His Phe Gly Ser 

30 T"/r Glr- Asr. Ala Pro Giy Va_ Gl^* .a- va- - ^_ 

as -° 

Thr LV3 ser Leu Leu Tyr Leu Asp Pro Arg Leu Arg Asp Ala Thr Ser 
100 2.0 = 

Tyr Me-. Glu His Leu Val Lys Ala Leu Glu Lys Lys Cys Gly T,^ Val 
--1= 120 "-^ 



115 

Gin - 

60 13 0 '-35 



Asn ASP Gin Thr He Leu Gly Ala Pre Tyr Asp Phe Ar. ^ Gly Leu 
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Xla Ala Ser Gly His Pro Ser Arg Val Ala Ser Gla Phe Leu Gin Asp 
145 150 155 -^"^ 

Le-i Lvs Gin Leu Vai Glu Lys Thr Ser Ser Glu Asn Glu Gly Lys Pro 
5 ' 165 2.70 175 

Val lie Leu Leu Ser His Ser Leu Gly Gly Leu Phe Val Leu His Phe 
180 185 ISO 

10 Leu Asp. Ara Thr Thr Pro Ser Trp Arg Arg Lys Tyr lie Lys His Phe 
195 200 203 

val Ala Leu Ala Ala Pro Trp Gly Gly Thr He Ser Gin Met Lys Thr 
210 215 220 

Phe Ala Ser Glv Asn Thr Leu Gly Val Pro Leu Val Asn Pro Leu Leu 
225 ' 230 235 -4*" 

Val Arc Arc His Gin Arg Thr Ser Glu Ser Asn Gin Trp Leu Leu Pro 
20 ' ' 245 250 255 

ser Thr Lvs Val Phe His Asp A^g Thr Lys Pro Leu Val Val Thr Pro 
■ 260 265 270 

25 Gin Val Asn Tyr Thr Ala Tyr Glu Me^ Asp Arg Phe Phe Ala Asp lis . 
275 280 285 

Glv Phe Se- Gin Glv Val Val Pro Tyr Lys Thr Arg Val Leu Pro Leu 
■ 290 ' 295 300 

Thr Glu Glu Leu Met Thr Pro Gly Val Pro Val Thr Cys He Tyr Gly 
303 310 315 

Arc GW val ASTJ Thr Pro Glu Val Leu Met Tyr Gly Lys Gly Gly Phe 
35 - - 325 330 335 

ASTD Lys Gin Pro Glu He Lys Tyr Gly Asp Gly Asp Gly Thr Val Asn 
' 340 345 350 

•40 Leu Ala Ser Leu Ala AJ.a Leu Lys Val Asp Ser Leu Asn Thr Val Glu 
355 350 365 

Aso GW Val ser His Thr Ser He Leu Lys Asp Glu He Ala Leu 
370 375 380 

45 

Lvs Glu He 
3S5 



50 <210> 3GL 
<211> 339 
<212> PS.T 

<213> Arabidopsis thaliana 
<400> • 

" Leu Lv^ LVS Glu Glv Leu Lys Ala Lys His Pro Val Val Phe lie Pro 



10 



60 



•S-v -I* v»-- Thr Glv Glv Leu Glu Leu Trp Glu Gly Lys Gin Cys Ala 
- 20 ■ ' 25 30 . 
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Asp Gly Leu Phe Arg Lys Arg Leu Trp Gly Gly Thr Phe Leu Cys Trp 
35 40 

Vai GLu Kis Men Ser Leu Asp Asn Glu Thr Gly Leu Asp Pro Ala Gly 
5 SO 55 SO 

lie Arg Val Arg Ala Val Ser Gly Leu Val Ala Ala Asp Tyr Phe Ala 
65 70 

10 Pro Glv Tvr Phe Vai Trp Ala Val Leu He Ala Asr. Leu Ala Kis He 
- - 85 90 ^= 

Gly Tvr Glu Glu Lys Asn Me^ Tyr Met Ala ?JLa Tyr Asp Trp Arg Leu 
100 

Ser Phe Gin Asr. Thr Glu Arg Asp Gin Thr Leu Ser Arg Mec Lys Ser 
.115 3.20 " = 

Asn He Glu Leu Mec Val Ser Thr Asr. Gly Gly Lys Lys Ala Val He 
20 130 133 

val Pro His ser Mec Gly Val Leu Tyr Phe Leu His Phe Met Lys Trp 

145 130 1== 

25 val Glu Ala Pro ^.la Pro Leu Gly Gly Gly Gly Gly Pro Asp Trp Cys 

Ala Lys Tvr He Lys Ala Val Met: Asn He Gly Gly Pro Phe Leu Gly 



15 



30 



180 



val Pro Lvs Ala Val Ala Gly Leu Phe Ser Ala Glu Ala Lys Asp Met 
195 200 205 

Arg Met Thr Arg Thr Trp Asp Ser Thr Met Ser Me- Leu Pro Lys Gly 
35 210 215 220 

Gly Asr. Thr He Trp Gly Gly Leu Asp Trp Ser Pro Glu Leu Pro Asr. 
225 230 233 

40 Ala Pro Glu Mez Glu He Tyr Ser Leu Tyr Gly Val Gly He Pro Thr 

^5^0 



245 



Glu Arg i^a Tyr Val Tyr Lys Leu Asn Gin Ser Pro Asp Ser Cys lie 
250 265 

pro Phe Gin He Phe Thr Ser Ala His Glu Glu Asp Glu .Asp Ser Cys 
275 280 2S= 

Leu LVS JU.a Glv Val Tvr -Asn Val Asp Gly Asp Glu Thr Val Pro Val 
50 290 " ' 255 ->00 

Leu ssr Ala Gly Met Cys Ala Lys -^^ Trp -^g Gly Lys Thr .^.rg 
305 310 2l3 

55 Phe .^n Pre Ser Gly He Lys Thr Tyr :le Arg Glu T^'r Asn His Ser 

32= -3- 

?ro ?r= Aia Ksr. Leu Leu Glu Gly Arg Gly Thr Gin Ser Gly Ala His 
340 345 -0 

val Asp He Mer Giy Asn Phe Ala Leu He Glu Asp He Me-. Arr Val 



60 
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355 360 365 

Ala Ala C-ly Gly As. Gly Ser Asp He Gly His Asp Gin Val His Ser 
370 ^'^^ 

Gly lie Pb.e Glu Trp 
385 



10 <210> 4(X 
<211> 1986 

<212> DNA . . 

<213> Saccrisxoinyces cersvisiae 

15 <220> 

<221> CDS 

<222> {1)..(1983) 



<*00> 4 aac caa aag agt gat tct 

20 ate ggc aca ccc trc cga aga a^. g^^ cag 

Men Gly Tto Leu PHe Arg- Arg Asr. Vc^^ Gin ash 
1 5 

« K I" ^ i IS S5 IS S S 51 S IS S 
^ jr; S SI Its SS ^2 S!5 5? s s s S5 

30 35 *0 

atr agr ggc ag. g=a aaa aga aac gag =gr gg aaa gac g= ga= agg 
lie Ser Gly Ser Ala Lys Arg Asn Glu A-g eiy ^.^^ 
SO 55 

" ^ j„ ..J ^ s is s r.= s 

Lvs Ara Ks-o Gly Asn Gly Arg uys .-ta - - = 80 
S5 ■ " 70 

40 azi: ttc ait cct ggc gca etc eta ggz gta cct ccg ccg etc ag^ 
He PHe He Leu Gly Ala Pbe Leu Gly Val Leu Leu ^_ 

85 ^" 
ggc get tat cat gtt cat aat age gat age gac ttg ttt gac aac ttt 
aiv Ila Tyr His Val His Asn Ser. Asp Se. Asp Le- -n 
100 

^= aaa eta tat tta aat gat tgg aaa gat gtt 

gta aat ttt gat tea e-t aaa gcg v, ^ ^g_T_ 

V-1 AS- »--ie Aso Ser Leu Lys Vax Tyr Leu A»p Asp irp . 
;j_5 - 120 

^ ^- - cag cct get aac 

etc cca =aa ggt ata agt teg ttt ate ga. 
Leu Pro Gj-r. Gly :-ie Ser se. - — - - 

130 

55 _ aar cra& aat ttt gcc gtt ggt 

tac tec aca cct tec tea gac ga. ^-^ ag^ gaa a ^ 
ier Thr Ser Ser Leu Asp Asp .eu Ser Giu As., .ne ^^^^ 

145 

- /-rar- rrc a»« cat cct gtt gta 
60 aaa caa etc tta cgt gat cat aac a-- ga,, ge^ ^^^^ ^^^^ 

Lys Glr. Leu Leu Arg Asp Tyr Aa*. 



96 



144 



192 



45 



50 



288 



336 



3 84 



432 
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165 



170 



atg gtr ccc ggt gtc att cct acg gga act gaa age tgg gga gtt: att 
Sel ?al 5ro Sly Val lie Ser Ti^r Gly He Glu Ser Trp Gly Val He 
180 185 

aaa aac cac aac cgc gat agt tct gcg cat ttt cat aaa egg ccg tgg 
IS IS SJ5 Lp sir ser Ala His Phe Arg Ly| Arg I.eu Trp 



576 



624 



10 



195 ' 200 205 

gga agt ttt tac acg ctg aga aca atg gtt atg gat aaa gtt cgt tgg 
Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

15 ttg aaa cat gta atg tta gat cc- gaa aca ggt ctg gac cca ccg aac 
Leu Lys Kis Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

ttt acg eta cat goa gca cag ggc ttc gaa tea act gac tac tte ate 768 
20 pL Th? £eu A^c La Ala Gin Gly Phe Glu Ser Thr ?^P Tyr Pne He 

245 250 233 



672 



720 



cca gga cat tgc att tgg aac aaa gtt ttc caa aat ctg gga gta ^c, 
La lly tS tI; lie tIp Asa Lys Val Phe Gin Asn Leu Gly Vax He 
25 ■ 260 265 270 



30 



50 



cgc tat aaa cec aat aaa atg acg agz get gcg tat ga>. ^gg agg ctt 
Glv Tyr C-iu Pro Asn Lvs Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 

gca tat tea gat eta gaa aga cgc gat agg tac ttt acg aag cca aag 
Ala Tyr Leu Asr, Leu Glu Arg Arg Asp Arg Tyr Pne Tnr Lys Leu Lys 
290 295 300 



35 gaa caa ate aaa ctg ttt cat caa ttg agt ggt gaa aaa gt-. t«u ttc. 
Glu Gin He Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cy» Leu 
305 310 32.3 



816 



864 



912 



960 



att caa c=c tct atg agt tet cag act ace ccc cac tec acg aaa tgg 1008 
40 li; Ilv kIs Se^ 6ly Ser Gin Xle He Phe Tyr Phe Met Lys Trp 

325 330 333 

gtc gaa gee caa ggc ccc ccc cac ggt aat ggt ggt cgc ggc tgg gtt 1056 
V^l Glu lla Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
45 340 345 3=0 

caa c=c a-= aat tea ttc att aat gea gca ggg acg ctz ctg ggc 1104 
%~\ Ci ser Phe He Asn Ala Ala Gly Thr Leu Leu Gly 

" 353 " 360 353 

^r.. aac ac= a— cca act eta acz agt ggt gaa acg aaa gat ace 1152 

li; !JI ?a£ Iro -lla Leu He Ser Gly Glu Met Lys Asp Thr 

37C 375 330 

, , ^ = ^ -cr- ttc gaa aac tec CCe tea 1200 

a~~ ca= t~» aaz aca tta ccc a-a -a- ra- -"-r a— - 

^- IT" X — , — m- M— -^-LT- £?*v Leu Glu Lvs Pne Phe Ser 

lie Gin Leu Asn Tnr ueu M«- .yr v__y -ie_ = 

385 390 39= 



- "rrcr crcTu cere ccs. 

21 ii; IS L!; ?S Jj: s;? s aiv iiv n. ^ 

405 ^-^ 



124S 
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acg era cca «5 gga gaa gag gtc arc tgg gg jar arg aag rca rcr 
Mer Leu Pro Lys Gly Glu Glu Val ile Trp Gly Asp 

420 "5 
cca gag gar gca rrg aar aac aac acr gac aca ra= ggc aar rrc arr 
Ser Glu Asp Ala Leu Asn Asn Asn Ttir Asp la- '■^ | 



445 



435 



10 cga rrr gaa agg aar acg age gar gcr rrc a^ a 

Ara Phe Glu Arg Asn Thr Ser Asp Ala .ae ash 
450 4=5 

aaa gac gcc arr aac arg aca rra rcg ara rca ccr gaa rgg crc caa 
15 Lvs AST3 Ala lie Asn Mer THr Leu Ser 11- se. -xo 

465 470 

5a Kfs s; IS rj m ^ n? l^^ § is 

20 485 



1296 



1344 



1392 



1440 



1488 



„, «a ccj =a= »= «| =J= ^ S? 

Glu Leu Arg Lys Asn Glu Leu His .^-s uy^ 
500 

gaa gra cca err cca gaa gcr cec cac arg aaa arc rar rgr ara rac 
Glu val Pro Leu Pro Glu Ala Pro His Ms. Lys 11- Tyr cy 
515 

30 ggg grg aac aac cca acr gaa agg gea rar gra rar aag gaa gag gar 
Gly Val Asn Asn Pro Tar Glu Arg Aia lyi v 

L -.=. g=-- -5 -II s= 5?? 1" "I «° 

35 Asp Ser Ser Ala Leu Asn Leu Tbr lie asp 

545 550 55= 

era rrc crc acc gag ggg gac gga ae= grr ccg crc grg geg car rca 
Val Phe Leu Tiir Glu Gly Asp Gly Thr v^l Pro Leu vax 
40 5S5 

arg rgr cac aaa rgg gcc cag ggr gcr rca ccg rac aac cer gcc gga 

Mer CVS Kis Lys Trp Ala Gin Gly Ala se o j.y 

10 583 

= =-rr Paa cac cac cca aar cga rrr gar 1824 
arr aac grr acr arr grg gaa a.g aa. cac =a. 

600 

50 ara cgr ggr gga gca aaa age g== gaa cac gra gac arc crc ggc age 
lis Arc Gly Gly ;^.la Lys ser Al* Glu .-^s va- asp 



1536 



1632 



1680 



1728 



1776 



c^^ 620 

610 



1872 



1920 



60 



t-- aaa act cca age ggt: aac ggc ga^ 
rcg gag zzg aac gac cac a^w - ^ 

i^^a Glu L6U Asr. lisp Tyr Ilb ueu Lys .xe Se. uxy 
525 "0 

=,= ... ==. === =» jj= ^ IL: §S jg 5ii In 

Leu Val Glu Pre ilrc Gir. i-eu se. Asn g55 
o43 
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1986 

atg ccc ttc cca atg taa 
Met: Pro Phe Pro Met 
660 

<210> 5 ol^ 
<211> 661 
<212> PRT 

<2i3> Saccharoitiyces cerevisiae 

^°°GlY Thr Leu Phe hrg Arg Asn Val Gin Asa Gin Lys Ser Asp Ser 
£ 5 10 i= 

15 Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 

20 25 

Asn His lie Kis His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 4= 

lie ser Glv Ser Ala Lys Arg Asn Glu Arc Gly Lys Asp Phe As? Arg 
50 ' 53 SO 

Lvs Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
25 65 70 75 

II e Phe He Leu Glv Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
83 90 

30 Glv Ala Tvr His Vai His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
105 -^^^ 



20 



100 



35 



val Asn Phe Astd Ser Leu Lys Val T-yr Leu Asp Asp Trp Lys As? Val 
115 120 " = 

T.eu Pro Gin Gly He Ser Ser Phe rie Asp Asp He Gin Ala Gly Asn 
130 133 140 

Tvr Se- Thr Ser Ser Leu As? Asp Leu Ser Glu Asn Phe Ala Val Gly 
40 145 150 155 1°° 

Lys Gin Leu Leu Arg As? Tyr Asn He Glu Ala Lys Kis Pro Val Val 



165 



45 Met val Pro Gly Val He Ser Thr Gly He Glu Ser Trp Gly Vai He 
ISO 185 190 

Glv A30 Asn Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 

200 203 



50 



195 



G-/ Se- »he nv,^ Mat Leu Arg Thr Met Val Met Asp Lys Val Cys Tr? 
210 " 21= 220 

Leu Lys His Vai Met Leu Pro Glu Thr Giy Leu As? Pro Pro Asn 
55 225 230 2^= 240 

Phe Thr Leu Arg Ala Ala Gin Gly ?he Glu Ser Thr Asp Tyr Phe He 
24? ^=0 



60 Ala 

260 



•a Cv TV- Tie Tr= Asn Lys Val Phe Gin Asn Leu Gly Vai Hs 

■ ■ £ib • 255 ^'0 
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Giv Tyr Glu Pro Asn Lys Met. Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 

Ala Tyr Leu Asp Leu Glu Arg Arc Asp Arg Tyr Phe Thr Lys Leu Lys 
290 ■ 295 300 

Glu Gin lie G^u Leu Phe Kis Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

He Glv His Ser Mec Glv Ser Gin He lie Piie Tyr Phe Met Lys Trp 
325 ' 330 335 

Val Glu Ala Glu Glv Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 ' 345 350 

Asn Glu His He Asn Ser Phe He Asn Ala Ala Gly Thr Leu Leu Gly 
355 * 360 365 

Ala Pro Lys Ala Val Pro Ala Leu He Ser Gly Glu Met Lys Asp Thr 
370 375 380 

He Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 

Arg He Glu Arg Val Lvs Met Leu Gin Thr Trp Gly Gly He Pro Ser 
405 " 410 415 

Met Leu Pro Lvs Gly Glu Glu Val He Trp Gly Asp Met Lys Ser Ser 
420 425 430 

Ser Glu Asd Ala Leu Asn Asn Asn Thx Asp Thr Tyr Gly Asn Phe He 
435 440 445 

Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
450 455 460 

Lvs Aso Ala He Asn Met Thr Leu Ser He Ser Pro Glu Trp Leu Gin 
465 ■ 470 475 480 

Arc Arc Va^ His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 495 

Glu Leu Arc Lvs Asn Glu Leu Kis Kis Lys His Trp Ser Asn Pro Met 
500 505 510 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys He Tyr Cys He Tyr 
515 520 525 

Gly Val Asn Asn Pro Thr Glu Arg Ala T\^ Val Tyr Lys Glu Glu Asp 
530 535 540 

Asp Ser Ser Ala Leu Asn Leu Thr He Asp Tyr Glu Ser- Lys Gin Pro 
545 ' 550 =55 560 

Va* ©h'* '^hr G'^u G'v Glv Thr Val Pro Leu Val Ala His Ser 

555 " ' ■ 570 575 

Met Cvs His Lvs Trs Ala Gin Glv Ala Ser Pro Tyr Asn Pro Ala Gly 
580 " ' 535 590 
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He Asn Val Thr He Val Glu Met Lys Eis Gin Pro Asp Arg Phe Asp 
595 600 

He Ara Glv Gly Ala Lys Ser Ala Glu Kis Val As? Xle Leu Gly Ser 
5 610 " 615 o20 

Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala Ser Gly Asn Gly Asp 
625 630 

10 Leu Val Glu Pro Jltc Gla Lau Ser Asn Leu Ser Gin Trp Val Ser Gin 

645 S50 655 



15 



Met Pro Phe Pro Met 
660 
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S£CCS;JCS LISTZNC 

<110> £ 1 >rr-r. s Dr., Seen 



<i:o> 

< u > 



<^10> 1 Ij^ 

<2ii> isas 

<2:»2> cenomlc DMA 

<211> Siccr^arorayces csrsvisiae 

c220> 

<221> COS 

<222> CD . . (19a2) 

::r:;: .^^ - =r r >t: " 

- " ^ Gin >sr. Gir. liVS Ser £cr 

Ksr Gly Th.r I-c\i Phe Ars Xrs As- v^. CX.. as.. y 



5 1° 



cc. ^a. =3= SS'~ cct S-it =a= c«. ^S- 

I« =1-. >^r. Asn Ly= Gly Glr ser V.l Kis Asn Lys ^ors OXu Ser A.. 

«t ca. cas c.a SS= ca^- aas aga .ga ass S5= 

as:. :-• - Zle Kis Y.Ls Gin Gin Gly Leu Gly His I.ys Arg Ar. ^5 .Gly 



35 40 



... .sa aa-. gac cs= o.c aaa sa. tec sac as- 
Scr Ser ?.l. I^ys ^.rs A=n Glu Axe Gly X-y« Ph. A.p ... 

30 ' SS " . 

aa. as. sa= SSS aac asa aaa cs- "S^ J- «= 

Ly.. ?.rs A3? Gly Asn Gly Krs V/= Arg Tr? ^rs 



3.92 



240 



Ss 70 



... a-.. SH- .ta SS- - - - 5 c=. c.t asc 2Sa 

Pn= I- -.a. Gly XVa Pne I.eu Gly v.l Leu Leu Pro 



. ^ aa- asrc car age sac 

1!: ;i; ^ ;i; 

2.00 



33 = 
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aa= c.t c« .a. s"-. t.s 9^= S-^- 

"v.l X.n V^. Scr I,eu I.y3 Val Tyr I.eu A.p ^5? Trp X-ya .... V.l 



lis 



120 



-c- etc acc crac ga::. ca^g gcr ggt aa= 432 

c« cc^. cac 5== -S^ ^" ^ - ' C,.,^ 

Lcu Pro Gin Gly lie S*r Siir Pm He ^.s? as.- 



ctrc cc?. caa ggr 2.13. a 

Siir P'n-a Xic ^^^r' 

130 

.„ .... «c c.. - - III 

Tyr ser Tlvr Ssr S*r l.au Xs? k 

143 ISO 

a... cca ==. gac cac === a.a «c ... ^u. ju. 

..y= GU. I.eu ^sp ryr Asr. ri. Glu ..1. I.y= -is P- V.X 

iss -*^° 

c— 2.= - =aa a== -55 S5* 

j.=g o== c=r eg= etc ett w=c S,- - 

K*-. val Dr= Gly val lie S^r Tivr Gly .le Glv — .rs 

»nr gr.g ca= =c=. =3^ ==^5T egg Sa* 

Giv ASP AS? Glu cy= ASS ser Ser f^a --.is -h- i.y^ 
193 

e-S stts =at aa.s a-- "S- "== '"^^ 
SS. as^ cac ate ctg ^.ja ac. a.^ g..- ^ 

Giy ser ?i.e T-yr Met Leu Tnr Met Vai K=. A^.- -/= 



57c 



720 



.10 

t.. .a. sat c=t sa. ^ac =ca ccg aa= 

-uy. Kl. V.1 H.-_ X^eu Asp .-r= Glu TT.r G.y I.eu -o - ^- 

22= 230 

gca ca. SS-: - .aa c=. a=- .a. t^ 7.. 

, sat Ct- "2- ^" £ls 

---- r:: III T.: L" iiy v.. 

Ala Gly Tyr Tr? j-le T_- A--^ -y- v— 
2S0 " = 

, === iat SS.- === -=S 2 = -- 

sr= ::at c== aar c-as- 2--= -== 

mw_- c- Ala Ala T>-r As? -~ --s -s- 

•Slv Tyr Glu ?r= Asr. Lyr- Ms. — ----- 

275 . 
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.ca cac «^ gat cca gaa aga cgc c.-. =.SS cct ^cs eta «s 912 

Z: Z II U .eu Clu .rs .rs Xsp T.c Pne T.r .v. I-cu X.y. 
290 29S 

«L-c caf ct= tts cac caa ccg as': S5= S*^ S-^" '^'^^ 

gas caa a^c c-*-- <=-=. "="— Vai Cys Leu. 

Glu Gin lie Glu Leu Phe Kla Gin Leu Sar Gly Glu Ly- y 

303 3" 

gs. cat .cc a., gg. cct cag act ac= tct ta= .c. aaa .g. 
SI- Glv Kis Ser Met Gly Sar Gin lie ^le 'r/'^ 



xaoa 



325 330 



gcc gag gc. gaa ggc cc. ct. ta= gat aa. ggc gg. cg^- 
vai Glu Ala Glu Gly Pre Leu Tyr Gly A.»n Gly Gly A.. ^^.^ . 



3^0 34S 



.ac g.. ca= a.a gac tea .=c aa. gca gca ggg acg ccg jgc 

Asa Glu Hie lie Ser Phe He Aer. ^la Al^ Ciy T ^ 
33= 

.... cc» «c ..c - 

Ala Pro Ly= Al^ Pro ^^1?- Leu lie se. o.y 
370 375 



lOStl 



1X04 



^ ^ ^ -,t^ c&a aag tec ttc cca X200 

at. eaa eta aae aeg .ca gee .eg ^ S.^ S . ^.^^ 

He Gin Lau Asn Tiir Leu XLa Me. Tyr ci. ^-^ 



3SS 



— f^— t coc e.ce cca tea 
aga a.t gag aga g.a aaa at:g Cta caa aeg t« g,. 5 
xre Zle Glu Arg val Ly3 Mac Leu Gin Tfer T..- Gly Cly 



405 -^^^ 



aeg eea eea aag gga gaa gag gee a« egg ggg 5- 
M*t Leu Pro Lys Gly Glu Glu Val Xl« .rp Gly H- 



1296 



-.20 «^ 



-^.^ a^* tac cgc aan ttc act 

tea gag gae gea eeg aat aac aae aet ga a . -3 
S«r Glu AS? ;aa Leu Xan A=r- Asr. Thr Xsp Trv- Tyr Q^y 

aae aaa s-ai teg aea aeg 

ega .tr eaa agg aa= acg age =a. ^^.^ _ 
Arg Phe Glu Arg xsn Tiir Ser «s.- 
450 <^- = 

cct caa err etc csa 

aaz cac s== "= !== Z'.l p^o Glu Tr? l:eu GLn 
lys Aae Ala Xle As= Mer Tnr Leu Sar .la 



12 ^^ 



13S2 



144 0 



470 ^'^^ 
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r.cTSL Rcc era csr gac cag tac tee ttc esc cac ccc aag sac eaa caa 143G 
Xrg Arc Ve.l His Giu Gin Tyr Ser Phs Gly Tyr £sr Lys ?.sn Gl\i Glu 
425 4SC 455 

gac cc;. ?.ga a=.£ aac gs.g eta cac car aac cac tgg ccg aat cca acg 1S3S 
Glu Leu Arc l.ys Xar. Glu Leu Kis Kis Lvh His Trp Ser Asr. Pro Met 
£00 SOS -2.0 

cia era ces ctt cca ga?. get ccc cac ate aaa ate tat t=t ata tac 1524 
Glu Val Pre I,su 3rG Glu Ala Fro Kis Hst Lys lie Tyr Cys lie Tyr 
£-^5 520 S2S 

gg= ccg aac ^-.ac cca act gaa. age gca rat gta rat aag gaa gag gat 1622 
Gly Val Xrr. Aar. Prro TKr Glu Arg Ala Tyr Val Tyr I^ys Glu Glu Asp 
S30 S35 S40 

gar tec ret gcr ctg aac Ctg cvcc ate gac cac gaa ace aao ca?. ce: IffSC 
Aeo scr £sr Ala I.eu Asn Leu Thr Zls As? Tyr Glu Sar Lys Gin Pre 
C.1C 530 SS5 SSC 

gta etc etc acc gag ggg gac gga ace gtt ccg etc gtg geg cat tea 
val Phe Leu Thr Glu Gly Asp Gly Tiir Val Pro Lau Val Ala His Ser 
ses S10 57S 

at9 tc;: cae £=a egg gee eag ggt get cca ccg tac aac cct gcc gga 
H^t cys Kie Lys Trp Ala Gin Gly Ala Sar Pro Tyr Asn Pro ^.JLa Gly 
530 " BcE £50 

att aac gtt act ate gt= gaa ELtg aaa car cag cca gat cga ctt gat l£-24 
Tie Arn Val Trir lie Val Glu Mat Lyr His Gin Pro Asp Arg Phe Asp 
= S3 600 



172.8 



177e 



«a cgt ggc gga gca aaa age gcc gaa cac gta gac ate etc gee age 
lie Arg Gly Gly Ala Lys Ser v?-la Glu His Val Asp lie Leu Gly Ssr 
e-LO 61S »20 

gcc cag ttg aac gat tac ate t:tg aaa att gea age ggt aat ggc cat 
Ala Glu Leu ;^^n As? Tyr lis Leu Lys Zle Ala Scr Gly Asr. Gly Asp 
«5 530 S2S 540 

etc gcc gag era cgc caa ttg ret aat tee age cac tgg gtt t:ct cag 
Leu. Val Glu Pre Arg Gin Leu Scr Asr. Leu Ser Glr* Trp Val Ser Glr. 

S45 €50 iSSs 



1B72 



1520 



ISSe 
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1985 

z.ts ccc ttc cca atg taa 
Mac Pro Pha Pre Met 

6qC 
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<211> 6S1 
<212> PRT 

<213> saccharomvcM ccrevisiac 



=1^ r... Ph. xr, X.. V.1 =in =lr. t,y= »r ••..=: 

ci» «n Lvt =iy "y V.1 .r. -y. =J» 

„n Ki= X.. Z HX= ... =1. "V ."u Hi. .X. "V 

X.C =" s„ .y. "° To 

Lvs ..re A« Gly Asr. Oly ^V' ^^^^ " go 

lie PH= lie I.eu Gly ?.la Ph* l.eu Gly v.l L 

.X. ^= vlt H» S.. «P S« 

v.. Ill S.. ... ^ V.X ^ ..u MP «P ^ 

, II, MP MP xi= 

Leu sr-o Glr. Oly Il« so. 

: ssr Slu x=r. ?he Xla vai Gly 

Tyr Ssr Thr Ser sar Leu As? As? I-et. S- 

, t!" ^- >3r. Xla Gl^i il'' ^-^^ 
Lys Glr. Le- Leu ^-5 As? -Ty- as.. - .^^^ 

^"^^ « n-v- CW lie Clu E*r Tr? Gly Va.1 Xle 

wet V*l P^o Gly val He Ser ai-.r Gly 1^- ^- 

s- «e- Kis Phe Arc Lye Are I.cu Tr? 

Gly AS? Asp Glu cys As? se- _e- 

t!! y.- VS.1 Mst AS? Lys Val cys Tr? 
Gly ser Pne Tyr Me:i Leu Arc T>- M— 

G^u -rlir Gly Leu Ac? ?re Pr= Miv 
Leu Lys Ki* Val Met Leu Asp Pro G-u 

* !r C- GW «he Glu ier T« AS? Tyr Pl^e Xle 

PT.S Tiir Leu Ars Ala Ala Glr. G.y -— 

^'-^ TVS Va^ '^e Glr. As:: Leu Gly Val Xle 

.Ms. Gly l-.^r Tr? Xle Tr? Kst. Lys v.. .n 

^" ser Ala ?ie T-.-r As? Tr? Arc Leu 

Gly Tyr Glu Pre ?.s--- Lys Me- — - - 

Al. T,.r Leu A=? Leu Glu Ars ^^r^ X"- ~= -/- J'^; 

i50 , . ,. e— c-iv Giu Lys Vsl C^r'S S^^'-i 

C-lu Glr. Xle Glu Leu Ph. Si. dr. S- 



30S 
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TW Tie «hc Tvr Sive Kcc Ly= Tr? 
lie, Glv His Scr Met Gly 3sr Gin . rvc 

3 3 0 

v.. «u =" ^» -V. o.y ... =iy o^v =^^V -r. v,X 

M„ =w «i. xi: x-^ - 

'„ Ti- s-r Gly Glu wea l-ys 
Ala pro Lvs Xla Val ?ra Ala 1-su Ix- Q y 

!n! M*^ TV- Gly Leu Glu Lya Phc Phe Ser 
He cm teu Asn Thr Leu Ala Mec T>. Ciy 

Xlc Glu V.1 .K. civ Xle ..o se. 

Hec L.U Lvs Tit Glu Glu v.l Xle U 01. A.p M.c Ly. Ser Ser 

. - ».n ABP Thr Tyr Gly Asn Pbe 11= 

Ser Glu Asp Ala. L«u Asn Asr. Rsn T... Asp * 

''^^ V° 1.1. -ohe Asr. Lys Asn Leu Thr Kftt 

Xrs ?hc Glu Arg Asn Thr S*r Asp Al* ?he A-., ^y^ 

"^^^ , Ti- se- pro Glu Tz-P Lau. Gin 

V/s ASP Ala He Asr. Mez Thr Leu Scr He Se. 

'''''' . »vv* GW Tyr ser Lys Asn Glu Glu 

Are Are V*l Kis Glu Gin Tyr Ser Phe G.y Ty- x 

G.u X.-U Hi- Ki= Lye Kis Trp ser Asr. Pro Met 
Glu Lsu Arg Z-vs Asn Glu Lsu r.i- — ^ t 

5 OH 

- r 

s„ ... - X., 

^'^^ . CI" -H- Val pro Leu VaLl Al:^ Kis ser 

Va.1 ?he Leu Thr Glu Gly Asp Gl^ -h_ vai 

- C-1-. Glv Ala ler Pro T>-r Asn Pro Ala Gly 
Ket cys Kis Lys Trp Al- C-l.. ggo 

, „ - M-t Lva Kis Gin Pro Asp Arg Phe Asp 

He As-, val Thr Zi* Vai «lu K-t L^a 

^ "° -3.^ KiB val ASP He Leu Gly Ser 

21s Arc Gly Gly Al- Lyr Sar Ala Clu k 

£10 -i. kTa c-^ Gly Asr. Gly AS? 

^la C-lu Leu >.sr. A." Tyr Xle Leu Lys .le Al. — 

"! T.. =- Asn Leu ser C-lr. Trp Val Gin 
Leu Vai Glu Pro Ar= Glr. Leu S— a-.- 



M*»- ?rc Pii* ?-o Ms* 
(SSO 
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<21C> 2ix 

<211> 2312 

<212> genorr.ic DMA 

<213> Scr.izosaccharoRiycss .pcMc 
<400> 3 

7>.TGGCGTCTT CCAAGAACfvG CA?JSl?wCTCAT AAG^-^.J^J^AG^. J^GAAGTCAA SO 
XTCTCCTATC ^.TTCi^JSAGAA. ?vCCAA.CTCCC GCTTTGAGTG 100 

AGCXXCCTTC AGCGTCCGAA ACACj:^-^TCTG TTTCAAATAA ATOV^.GA^^. 15 C 

rCTAAATTTG GAAAAAGATT GAATTTTATA TTGGGCGCTA TTTTGGGAAT 200 
ATCCGGTGCT TTTTTTTTCG CTGTT GGAGA CGACAATGCT GTTTTCGACC 25 0 

• CTGCT ACGTT AGATAAA—-!.* GGGAATATGC rAGGCTCTTC AGACTTGTTT 3 00 

GATCACATTA AAGGATATTT ATCTTATAAT GTGTTTAAGG ATGCACGTrT 33 0 

TACTACGGAC AJ^CCTTCGC AGTCTCCTAG CGGAAATGAA GTTCAAGTTG 
GTCTTGATA.T GTACAATGAG GGATAXCGAA GTGACCATCC TGTTATTATG 
GTTCCTGGTG- TTATCAGCTC AGGATTAGAA AGTTGGTCGT TTAATAATTG SOC 
CTCG.ATTCCT TACTTTAGGA AACGTCTTTG GGGTAGCTGG TCTATGCTGA 55 0 

AGGCAATGTT CCTTGACAAG CAATGCTGGC TTGAACATTT AATGCTT GAT 
AAAAAJACCG GCTTGGATCC G.-J^GGGAATT J^J^CIGCGI^G CAGCTCAGGG 
GTTTGAAGCA GCTGAT'ITTT TTATCACGGG CTATTGGAIT TGGAGT?AAG 
TAATTGAAiXA CCTTGCTGC^. ATTGGTTATG AGCCTAATAA CATGTTAAGT 
GCTTCTTACG ATTGGCGGTT ATCATATGCA AATTTAGAGG AACGTGATA--. 
ATATTTTTCA AAGTTAAAA?, TGTTCATTGA GTACAGCA;i.C ATTGTACATA S3 0 

AGAAAAAGGT AGTCTTGATT TCTCACTCCA TGGGTTCACA GGTTACGTAC 900 
-TATrTTTTTA AGTGGGTTGA ACiLTrCiAGCGC TACGGAAATG GTGGACCQAC OSQ 
TTGGGTTAAT GATCATATTG AAGCATTTAT AAATGTGAGT CTCGATGGTr XOOO 
GTTrGACTA.C GTTTCTAj^.CT TTTGAATAGA TATCGGGATC TTTGATrGGA ICSO 
CCACCCAja;AA CAGTGGCAGC GCTTTTATCG GGTGAAATGA AAGATACAGG 
TAT-rGTAATT AC7vTTAAACA TGTTAA.TA'r: T^XTTTTTGC TAACCGTTTT 
AT^GCTCAXTT GAATCAGTTT TCGGTCTA-TG GGTAAGCAAT AXXTTGTTGA 

GATrniTTi%c TAr-rrrACTG tttagtttgg aaaaattttt ttccccttct 

GMGTATA.TT CAAAAATACA A:\TGTGCTCT ACTTTTTCTA ACrTTTAATA 
GAGAGCC^LTG ATC-GTTCGCA CTA.TGGG?wGG AGTTAGTTCT XTGCTTCCW- 135 0 

AXSGAGGCSX TGrrSTATGG GGAAATGCCX * GTTGGGXAAG >JUi.TA.rGTqC 140 0 
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€00 
c50 
700 
750 
£00 



1100 
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1200 
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TSTTAATrrr ttatt^atat ttaggctcca gxtg^-ctta xtcaaacaaa 

TTTTTCCAAT. GCnOCr^.TTC^ TTCG^TATAG ^.G^GACAT? G>.T^GG;.CC 
ACGXTGAATT TGACATAGAT Gi^TGCATT^C AATTTTTAX^ AAATOTTXCA 
G;.TGACGATr TTA^^GTCXT GCT*:=CGAAX AATT^TTCCC ACGGTCTTGC 
TrGG;.CTG^.X ^.Z..^T^ T,.^A?^TAA CGA^TGCCG TCT^^TGGA 

TA,-,^TCCGcr xga;^taaoX xc;.rrAA.^GT t^ctaaatta tactaaccca 

AATAGACTAG TCTTCrTTAT GCTCCTGAT;. TGAAAATTTA TTSCCTTCAC 
^-^3^^-..^ AACC^ACTGA GAGAGGTTAT TArTATACXA XTA.^TCCTGA 
Ga.GC._XC=T GTCATTGATT CCTCGGTTAA TGATGGAACA AAAGTTGAAA 
ATGTGAGAGA ATTTATC-TTT CAA^XTTCT ATTAACTGTT rTATTAGGCT 
A-.TGTTATGG ATGArCGTGA TGGAACTTTA CC;^>.TATTAG CCCrTGGTTT 

GGTCTCCJ^T a;....gtttggc >aacaa;^^-».o gt^a;'.tcct gctaatacaa 

GTATCACA^ TTATGAAATC AAGCATGAAC CTGCTGCGTT. T=AT=TOACA 
GGAGCACCTC CCTCGGCAGA ACACG^CGAr ATACTTG3XC ATTCAGACCT 
AAATGTATGT TCATT-TTXCC TTA=AA:.TTZ CTATTACTAA CTCTTGAAAT 
AAGOAAATTA TTTTAAAAX.T TTCATCAGGC CXTCGTGACT C^CTT^C^ 
CCGTTATATA rCAGATATCC AGTAC3GACA TA^CT«TGT AGATTGCAAt 
TAACTAACTA ACCGAACASC- GAA.-.T>ATAA ATGAGATAAA TCTCGATA^A 



1450 
ISQO 
1530 
IfiOO 

1700 

1750 

ISOO 

ISSO 

1900 

1550 

2000 

2050 

2100 

2150 

2200 

2250 

2300 

2312 
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<210> *^ 

c2Xl.> 358S 

<112> gsrvocr.lc DMA. 

c213> ^.ratoldopsxs thaliana 

<400> 4 

.^"CGACG GAGAAACCW CGACGCCGCG 50 

ISO 



r.TCTGW.GAG GTGGTGCACG ATGAfi-XTT. C 

■^,ATCGAACG GAGGWSGGAA GTG^.CGxG. 

-TTCCAAATC CCACCAT?AG AAATCGAA-O 

Taji, cCTGGTGGTi 

^^-T- CATTGGGTGT GTGTGT-TA-. CC^ 

cx.crrc.rT ^^^^^ 

..XTCTCTTC «...=-CO «.TC.=« 

— ;^ _^„c. 

^^XC „^-^-.0^ .C^TCC-T. 

cc^crr^o c.c^« 
ncc^c^T .=TC«rc« «^ ^^^^ ^^^^^ 

^"t^. L=.^c= «cc«c=« 

- 7^^^, 

^„cc„ .=c.„-..«^ — 

..^.ccc.. 

„===cx«. =.ccc«=.r x.c-^ 

, _^-rc»C COTOTAATQA MTMT-CT- 
C^C-«^ 

>,.»^cc-r -rTC««« c«cT=«^ «=rTtT= 

. c=««x«. "'^ 

.^rrr-^ ■7:_^„ ,„,„c.cc cr=«..cc. 

,.x^TT«« — ^^-^ xx««=..= - 
^ r=K=T=«x cTT=>.T=Trr Tt--"- 

Z.^>^ 
"^'"ii: ;.„-^„c =x.«.=.- 



200 
250 
300 
3S0 
400 
450 
500 
550 
600 
SSO 
•700 
730 
5 00 



£50 

300 

330 

1000 

1050 

1100 

1150 

1200 

1250 

1100 

1150 
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IdOO. 

i4S0 

1500 

X5S0 

Z£QO 

1S30 

1700 

17S0 

1300 

1330 

1900 

19S0 



•CCXCG^CrCO TGGCTCCTGA CT^CTTTGCT CCTGGCT..CT TTCTC7==SC 
;^TC=TG^TT CCTA^CCrXG C;«:««TGG AT^TG^GAG ^^TATGT 
AC^TGGCTGC ATATGACTGG CGGCTTTCGT TTCAGAACAC AGAGGTTCTT 
TTCrCATCGT TCTTTCTATT ATTCTGTXCC ATGTrACGTT TCTTTCTTCA 
TTACTTAAGG CT^A.^..TATO Tl^CATGTTG AATTAXTXGG TACGTGATCA 
OACrCXXAGC CGTA.G..AX CTAAXA.AGX GTTGA^CG^ TCTACCAACG 

--TTCCGCATT CCXTGGGGGT CTTGTATTrr 

GTGS/\?>AAAA AGCAGTrATA GTTCCos-Ai - 

f— rCCTCTGG GTGGCGGGGG 

CTACATTTTA TGAAGTGGGT TGAGGCACw^ Gw.CCxti 

i.rxA.TATrAA GGCGGTGATG AACATTGCTtS 
• TGGGCCA3AT TGGTGTGCAA AGTATrvTTAA 

CACCATrrcr tggtgttcca aa^ctgtto cagggctttt ctctgctga.. 

GCAAAGGATG TTGCAGTTGC CAGGTAT-rGA ATATCTGCTX ATACTTTTGA 
^.XCACAAC CTTGGCTCTG GAACTCAA.^C TTATTCTACT AAATATCAAT 
XCXAA.AACA ^OCXA.A.. AXCGCGGAA CGACAXXGG .XGAX.AXX. -00 
^OCXGC^A XGX;^=XGAA ACXCXCTXOA CAXXAGACAA AXGATGAAXX 
.CGCAXXGCX CXGXGAXGAC.C^TXTCXXA GCXXCGACGA 
r^-^^r-r-T T-TGGAGGGC AXXGAAXXTX GCXAXGGAAA 
==,=;^C ^CO...^ .=.TXC.«. ==«TT«» 
XCTTTC^XT XTC^^T^X.. .=C.T=..T= .CTTCTT^T 

rCTA-^TA GTCACTXGGX XCCTTAA.XAT 
XASAGACICG XXGGAXTAGT XoCXCXA-TA ^ 

_ ^r-xTTGCCCC AGGATTCXXA 

MixcTiTM: •rtrcrraw^ m^ttocasa- co-ttgc- 
<=«^==«;. T.T«X«=r ««xc=Tr= c;u=a.T=T^ t==^.^t=« 
^c^..o =.cx=«=- «.cr.T=« 

„„,„^--.e- JlAGGCCAflAC CXGXXGTGGG 
TAXGGGGCGG GCXXGATXCG TCACCG«AG-. A-^GGww 

....'-1 j^-T-^GTGSX GAAGCAGGXG AAAACGGAGX 
Ar-.5.AAGCA>.A AGAACAACGA AA-x.w.TC«i 

TTCC«=^ «T.T«;«= a^».T«=T TTX===«« 

;^.«=^.. C=C,==«=X X=T«=^«A «««TXa* TTTTC^OT^ 
«.TC.T«.T. «TrX=T=« T«.«.«. 

c=.v.,=«. c^r.^^ — ^=^"=-: 

«=^O^T= «^--T«== CTC^TXT^ 
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CT.XKO^TC ™.X.. ...X^ CCCXCC.CCC 

CCVCTTTCX TCCTC.Xa.T XTGOXTC^. CCA^T.XC 

^ CAA ATCCGTTAGA OXCMCTPA GTGATTTCrr 

O^TT=««T TCCTO.T^C •rT«=«TCT TTTTOrTTTC 

coTirrrcrrTO cat«==ttt ««=ctc^ cctta=«m ct=t^=t=^ 

OCCTTTCTOA «^==«= =TC«T«.X. TT=.==XC=. »X=..<=«A 

=;.«T=^crc TT==m:«« *TCCT=ca« t==«-xtc t==tt^« 

0^^««^ «Cr==T<=>. XTOO^XXCT- «T=.^«. C==.=TC=.= 
„«C«XC= «^^=..- =«"-^ =T=CC=XC« 

• xTC=«c=cc TTXca^TXT .cuTrTcrac Ta.c=«==« a.==«=M:. 

=Ct=T=T=^ ^=«==«" T.=*.«T=<= A«C=CXT» «C«T-=C 

^^r-Tt-CAAAA GCGTGSCGTG GCAAGAOAG 
GTCCTAAGTG CCGGGTACAT STGTGCAAAA GwG 

J.GA'-rrATAT AAGAGAATAC AATCACTCTC 
"c-lcC^CX,^ CCT.^-- -^-^'^^^^^ ^^^^'^"^ 

^-^-.^G*TA TAGGXCATGA CCAG«^ww=^.C 
CGGAGGTAXC GGGTCTG^T-rx i/^^ 

•-r-r G^CCTGAAGC TGTC3A 



2900 

2950 

3000 

30S0 

2100 

3130 

3200 

3230 

2300 

3350 

3400 

2450 

350O 

3550 

3cQ0 

3 55 0 

35S5 
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<210> 5 jl^ 
<211> 402 
<212> CDWA 

<211> Arabidopais chaliaas. 

<220> 

<211> CDS 

.<222> (120) . . (402) 

A=AXACXGCTCTTrGTCTCT CTCGXCTGATCTAXCAATCC CTAATCTGTGTTCTAAATTC 

CTGCXCaAGATTTG^Ci.-^G TCCGT?.TAGCTTAACCTGGT TTAXTTTCW^TGAC;.GAT 

ATC CCC C-rr C^- CGG AAA. ^-AG CCG ACG CAG AAA CCA TCG ACG CCG 

Jit III IVu Ss I^y^ ^^o Xhr Olu I,y« pro scr Thr Pre 

St: IS I- ^ IT. ^^^^"^ JiS S?5 

-~ ^nr- T-rr P.-r^ GGG TGT GTG TGT GTA ACC 
TCG TCC ATC GXT TCT TGT TGT TGG TTC A.. G-J T 

Scr Cyc lie J^-sp Ser eye Cys Tzrp Pne i.c v^xy 

^r'R ?vTC GCG AGC TTC CCT 

TGG TGG TTT CTT CTC TTC CTT TAC A;^C GCA ATG ^ 



?S S? IS Si a;; 

CAC TAT CTA AGO =AC === AAT CAC GHO TOC TTT CCC TTK CCC C 

Gin Tyr Vil Thr Glu Pro Asrx His ??? Ser Pne i^-a. 



60 

119 

1<S7 

215 

2S3 

- 311 

25d 

402 
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<2io> eh" 

<211> 643 

<212> cDMA 

<2135» 2ea inays 

<220> 

<221> COS 

<222> (1) . • <4Q2) 

<400> € 

CGG GAG AA.^. ATA GCT GCT TTG AAG GGG GGT GTT TAG TTA GCC GAT GGT 4B 
A2r= Glu 1-ys Xle Ala. Ala Leu I-ys Gly Gly Val Tyr Leu Ala. Xzp Gly 
I 5 10 . 15 

GAT GA.a. ACT GTT CCA GTT CTT AGT GCC GGC TAC ATG TGT GCG JJ^A GGA 3 6 
Aso Glu THr- Val Pro Val Leu Ser Ala Gly Tyr Mez Cys Ala. Lys Gly 
20 25 30 



TOG CGT GGC A>-A ACT CCT TTC AGC CCT GCC GGC AGC AAG ACT TAC GTG 
TXD Arc Glv' Lvs Thr Arg Piie Ssr Pro Ala Gly Ser Lys Thr Tyr Val 
3'3 40 



144 



1S2 



AGA GA?^. TAC AGC CAT TCG CC^. CCC TCT ACT CTC CTG GAA GGC AGG GGC 
Arcr Glu Tyr Ser Kis £er Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
SO SS SO 

Arc CAG AGC GGT GCA CAT GTT GAT ATA ATG GGG A^C TTT GCT CTA A.TT 24 0 
Thv Gin Sttr Gly Ala Kis Val As? He Mec Gly Am P'ne Ala Leu Xlc 
65 -70 75 SO 

GAG GAC GT*-^ ATC AGA ATA GCT GCT GGG GCA ACC GGT GAG GAJ^. A.TT GGT 
Glu AsD val Tie Arc Xle Ala Ala Gly Ala Thr Gly Glu Glu Xle Gly 

ei 20 es 



ase 



GGC GAT CAG CITT TAT TCA GAT ATA TTC AAG TGG TCi^. GAG AV. ATC AA.i^. 33 6 
Glv Asn Gin Val T\-r Ser As? Xle ?h£ Lya Tr? Ser Glu Lys Zle Lyc 
100 105 110 

TTC AA.A TTG TAA CCTATGGGAA GTTAAAGAAG TGCCGACCCG 



321 



Leu Lys Leu 
1X5 



AAAGTGTCCT GCCTGAGTGC ?^J^CTCTGGAT TTTGCTTAA^. TATTGTAATT TTTCACGC 4 43 

TTCATTCGTC CC'ITTGTCAA ATTTACATTT GACAGGACGC CAATGCGATA CGATGTTG 3 07 

TACCGCTATT TTCAGCATTG TATATTAAAC TGTACAGGTG TAAGTTGCAT TTGCCAGC 5<S5 

TGAAATTGTG T AGTCGTTTT " CTTTACGATT TAATi^^CAAG TGGCG<=AGCA GTGCCCCA S23 



643 
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<*ii> lis 

<21i> PR- 

<^00> 7 



Xr=. Glu Lys He Als 
1 : 



Ala I.ys Gly Gly Val Tyr Leu Alt X^? Giy 



10 



Asp Glu Thr Val Pro VaI Xie 



u Ssr Ala Gly Tyr Met Cys Ala lays Gly 



20 



25 



30 



Tro Arc Gly Lyr Thr Arg Piie Ser Pro Ale Gly Ser r-y| Thr Tyr v^l 



Arc Glu T%-r Ser His Ssr Pro Pro 



Ser Tiir Leu I*eu Glu Gly Arg Gly 



SO S5 " 



Thr Gin ser Gly Ale His Val Asp He Mec Gly Asa Piie Ale Leu Zle 
65 70 ^=*. 

Glu ASP val He Arc Xle Ale Ala Gly Ale Ttvr Gly Glu Glu Xle Gly 

Gly A.p Gin val Tyr Ser Asp Xle PH. Ly. Trp Ser Glu Lys Xle Lys 
100 2.03 



Leu Lys Leu 

xrs 
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<2io> e ^ 

c211> £16 
<212> cDNA. 

<213> Keurezpora crasaa 
<«oo> £ 

GGTCCCGXXG /LCCAilGCSCGG AAGTTGGAGG 
AGXTGG^wTCT XCCCTCTAGA GACACGACTA 
GT^^rA^GTT TirrAXGCGT^ ggaagccgac 
TGGCGCCCGA TCCCGCGJUTG ACAACGCATC 
XCTTTG/vCTK AGGGGCACAT TGACCACGGT 
TGGCACAGTG AACCTTXTGA GTTTGGGGTA 
AAATGAACAG ATACAATCCT GCGGGCTCAA 
CCGCATGAAC CAGA/v^CCGTT CAATCCGAGA 
CTTAA.*rATG TAG;i-AAAGGT TGAAATTTAT 
ACATAGGTTA CTCAMAGTA TG^-CTAATTA 
AATcAAAA A ^A AAAAAA 



CTAACGAGAA TGACNCTCGG 


SO 


CCNTTGCACC CAGCCTCAAG 


ioo 


GGAGCGAGCC TACATCXATC 


ISO 


TTTAGATGAC GATCGATACG 


200 


GTGATTTTGG GCGAAGGCGA 


250 


CCTGTGCAAT AAGGGGTGOA 


300 


AAATAACCGT GGTCGAGATG 


3SQ 


GGAGGGCCGA ATACGGCGGA 


500 


GAAGAGTAAT TAAXTACGGC 


5S0 


;lJiJI!JsAAAATT TTTTTTCTAA 


600 
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<210> 9jL^ 

<211> 156:2 

<212> cjanotnic DNA 

<213> Arabidopsis chaliana 

<400> 9 

ATGjy>J\AAAA T7wrCTTCACA TTATTCGGTA GTCATAGCGA TACTCGTTGt,' SO 

GGTGACGATG ACCTCGATGT GTCAAGCTGT GGGTAGC^AC GTGTACCCTT 10 0 

TGATTCl-GGT TCCAGGAAAC GGAGGTAACC AGCTAGAGGT ACGGCTGGAC ISO 

AGAGA.ATACA AGCCAAGTAG TGTCTGGTGT AGCAGCTGGT TATATCCGAT 20 0 

TCATAAGAAG AGTGGTGGAT GGTTTAGGCT XTGGTTCGAT GCAGCAGTGT 2S0 

TATTGTCTCC CTTCACCAGG TGCTTCAGCG A.TCGAATGAT GTTGTACTAT 3O0 

GACCCrrRaTT TGG&TCATTA CCAAAATGCT CCTGGTGTCC AAACCCGOG-r 330 

TCCTCATTTC GGTTCGACCA AATCACTTCT ATACCTCGAC CCTCGTCTCC 4 0 0 

GGTTAGTACT TTCCAAGATA TATGATTTTG GGACATTTGC ATAATGAACA 4S0 

J^AATAGACAT AAATTTGC:GG GATTATTGTT ATATCAP.TAT CCATTTATAT SOO 

GCTA6TCGGT AATGTGAGTG TTATGTTAGT ATAGTTAATG TGAGTC-TTA.T 5 30 

GTGArTTTCC A:rTTTAAXTG AAGCTAGAAA GTTGTCGTTT AATAATGTTG 600 

CTATGTCATG AaAATTATAA GGACACTATG TAAATGTAGC TTAATAATAA SSO 

GGTTTGATTT GCAGAGATGC CACA.XCTTAC ATGGAACATT TGGTCiLSACC 70 0 

TCTAGAGAAJv AAATGCGC<3T ATGTTAACGA CCAAACCATC CTAC-GAQCTC 75 0 

CATATGATTT CAGGTACGGC CrGGCTGCTT' CGGGCCACCC C-TCCCGTGTA 800 

GCCTCACAGT TCCTACAAGA CCTCAAACAA. TTGGTGGAAA AAACTAGCAG 850 

CG»X;AACGA.A GGAAAGCCAG TGATACTCCT CTCCCATAGC CTAGGAGGAC .900 

TTTTCGTCCT CCATTTCCTC AACCGT.ACCT- CCCCTTCATG GCGCCGCAAG 530 

TACAT-CAAAC ACTTTGTTGC ACTCGCTGCG CCATGC-GGTG GGACGATCTC 1000 

TCAGATGAAQ ACATTTGCTT CTGGCAACAC ACTCGGTGTC CCTTTAGTTA 1050 

ACCCTTTGCT GGTCAGACGG CATCAGAGGA CCTCCGAGAG TAAGCAATGG IIQO 

CTACTTCCAT CTACCA.^J^GT GTTTCA.CGAC AGAACTAAAC CGCTTGTCGT 1150 

AACTCCCCAG GTTAACTACJ:- CAGCTTACSA GATGGATCC^ TTTTTTGCAG 17.0 0 

ACJ>.TTGGATT CTCACAAGGA GTTGTGCCT7 AC^AGAC^AG AQTGTTGCCT 12S0 
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TTAACAGAGG AGCTGATGAC TCCGGGAGTG CCAGTCACTT GCATATATGG 
G.i;3AGGAGTT GATACACCGG AGGTTTTGAT GTATGGAAAA GGAGGATTCG 
ATAAGCAACC AGAGATTAAG TATGGAGATG GAGATGGGAC GGTTAATTTG 
GCGAGCTTAG CAGCTTTGAA AGTCGATAGC TTGAACACCG TAGAGATTGA 
TGGAGTTTCG CATAa=.TCTA TACTTAAAG^v CGAGATCGCA CTTAAAGAGA 
TTATGAAGCA GATITCAATT ATTAATTATG AATTAGCCA;^. TGTTAATGCC 
GTCAATGA\T QA 



1200 
1350 
1400 
1450 
ISOO 
1550 
15S2 
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<ni2> 9cr.0rr.ic d:;a 

<212> Arabidopsis bha liana 

c400> 10 



ATGGGAGCGA 
TTTTTTCTTG 
TTCACGGCGA 
TCGACGCJSwGC 
CGrvCTTCA,^t 
ATCTTCATTT 
AATTCCAAGC 
TCGTTCATTA 
AAAACAGCTQ 
TAGCGCAATG 
CAAGTTTTTC 
CTTGTGCATT 
TGGTTTACAG 
TAGATCCTTA 
AGTGGTCTTT 
TTTCGGATTT 
TC7GATATAA 
TTTCTACTGT 
GAACCAA/vTG 
C^J^-^TTGGAA 
TCAGGCTAAT 
TCTGGTCGTC 
CCGTGGCGGC 
TCAGATACTT 
TTGAACTGGC 
CCi-A.CTATCC 
TGTTGTGACT 
GAGCTCCTCT 
GTA;i.CGTTTG 

TGAAAGTATT 
TAGATCTTGA. 
GTCAGCGGCC 
TTCAGATTAT 
CTTTrTATTT 
TGACATGCGC 
GGTTGTTGTC 
AAGA^TTGCA 
TGCAAAGAA,^ 
CAAAATATTC 
ACTAGCGGTT 
CCAA-GAATCT 

CCCrTTTGTC 
GCA;^.'rAGAAG 
GAAGTACCTA 
TTGCTGGCTT 
TGTTCATATG 

ATTCTCTAGT 

TcrrrcTGA^. 

TGATGACCCT 
AAAATGTATT 
ATTCTCAATA 



ATTCGAAATC 
ATTTGCGGTG 
CTACrCGAAG 
TACGAGCGTG 
CCGCTCGACC 
CCTT CGCTCC 
Gr'^JVATATAGC 
GTCAACAGTG 
ACTCGGCGAG 
AATGTGTAAT 
AGAGTGCTCA 
GTGATTCTTT 
CTTCTTTCTG 
TA?lTCAAACA 
CAGCCATCAC 
TTCTTTCTTT 
TATGGCTAAG 
CTGGAAAGAG 
CAATTGTCGC 
QAGCGTGACC 
GTCTTTTATC 
TTCCTTTTTG 
CCTTCTATAG 
TCTGGAATGG 
TTGATCAGCA 
TTAAGTTACC 
TACTGGATTG 
TCTTGGTTCT 
GCCTTCCTGT 
TGATATCAJVC 
ACTTTTGTTA 
. AGTGCTAGTT 
TTAGCTAATA 
TATGGTAGAC 
TAATAGGCTA 
TTCTCATGTT 
CAATTCTTTT 
AGGGTGATAA 
GATAAGCGCG 
TGGCTGGCCG 
A.aACTCTGTA 
TCACTCTCAT 
GCTCTAGTCA 
TTTCACAGCC 
ACTATGACCC 
CCTTTCTTTG 
CriGTACGTC 
CTTTGTCTTT 
GATTATCAGT 
ATG;i.\tGCAA 
TTGTTTTGAC 
TTA.TATATAA 
GTTTTXAATC 
TTGCATATAT 
TCACATTATG 



AGTAACGGCT 
GCCGAACTGC 

GTCGATCCTT 
TCGTATGGCT 
TTATTCTGTC 
AATGAAGCAT 
ACGCTTCTGA 
TCTTTCCCAT 
TAGTCTGCGC 
ATAGTAGTTA 
TGGTTGTTGC 
CTGTCAACTG 
GACCATCCCG 
AGAATTGGAT 
TGAGTTTTCT 
TTCATTAATT 
TGGCTTAAGT 
.TGTTCCATAC 
TTTACTTTCA 
XTCTCTTTTT 
CAGGTTGACC 
TATTXGCCCA 
CTGAGGCTAG 
TATCCATGCT 
ATTTTATTTT 
AGCTCGATAC 
CTTGAGGCAJ*. 
TTCTGAGGTG 
Ci^.G GTCTTA.T 
ATTGAACTGC 
ATCAAAGAAC 
CAACCAAACC 
TTTAAGTTGA 
TGATTTGTTT 
TTTTGTTGGC 
GCGTCGTCAT 
CACATTCTGG 
TATACCACTG 
ACAAATATTA 
TATGCAACTG 
ATTTCGTTCC 
ACATGACCAG 
CGTGAACTAG 
AGATAGCAAG 
TGATAAGAAA 
AAATTGTTTT 
CTTACTATAA 
TCTCTCCTTA 
AACGGGGTA.T 
TAATAGCGTC 
CATGCTAACT 
CTCTGACTCC 
GGTGCTCATC 



TCCTTCACCG 
GGTGGAGGAT 



CGTTGACTTT 



GACTGTCCAT 
AGACACCACT 
GGTCGAGTCA 
GTCTCGTCTC 
ATCTGACTTT 
CGCTTTTGGT 
TTTTTATXCA 
GAAJVATGTTA 
TTACTGATCG 
CTGGTTTAAG 
AGTGTAAGTC 
CCAGGTTACA 
TCAATTTGAT 
TGGTCAATTT 
GGTGTGTTGA 
GATl'GGAGAT 
CAAGCTCAAG 
ATGTAAGATA 
TTTGAAACTG 
TTCA^.TGGGT 
AAATTGCACC 
TATTTCGCTG 
TTCTCTAATT 
CTGATTTGTT 
TCAA.ATCTAC 
ACCTCTGACT 
A.\CTCACTGG 
TGTACGCGAT 
AXATTGTGGG 
ACATGTACAC 
CAAGAAACTT 
ATTGAAATCA 
AAGGCTTCAG 
TGxGGCTTAT 
ACGC AT' XTT l 
TGATGAAGAG 
TTAACATTGA 
TAACACTAAC 
TTTGATGTGT 
CATGGAATGT 
CAGATCGGAC 
AGGATGTTAC 
TATTGCTCAT 
GTTTAAATCT 
G;^JfcwACAAGTA 
XATTATGGA?, 
TTTAGTTGAT 
AATTTTGTTT 
ATACTTTTCA 
TTGGOAGAGA 
TAAAGACAGA 
GTTATTATA7 



TCATCGCCGT 
GAGACCGAGT 

ACACTCCGTT 
AAGGTCCGTG 
CTTGTXGATG 
XCXTATTGAT 
AGAGTCAXAX 
XCGCXAAATG 
ACTAGAXCTG 
GGTCATTTTA 
ACGTGATGGA 
XGXATGGTCC 
ACGGCCTGAC 
XAACAGGTAG 
AXCATCTTGX 
XCACGXCCTC 
GTXTGGTAXA 
XGXCACCAAC 
TTAGXCCTTA 
AGCTAAGAGC 
CTXTAAAACX 
AATAATGTCT 
ASJ^J^CATXAT 
TTGGXACCGG 
GGGGGAGTTA 
GTTGATTTAJG 
TCTCTCTGGT 
TCTCTTTACX 
ATXTTCCTTT 
AXGGTATCTG 
TAGTATACCT 
XGATTXAGXT 
TGACXGAAAX 
TGTGACATAX 
GGAACTGCXC 
GCCATXXTCA 
CTGGGGGTGC 
GAATATCAAX 
AAXXCCTTCC 
AA?vAGTTTCA 
AXCCATCAGX 
GGCCTTCCCA 
TCXXTTCAAA 
ACCAGTTAAr. 
CGAXCA.TCAC 
CXAXAXCAAX 
XAATCAGAAA 
TGTCXTTTTC 
TGAXXCXCXC 
TXCXAGCAAA 
GGXTGXATCA 
CCACCTATAA 
GGXAXGATGC 
TCCCCATTTG 



50 
100 
130 
200 
250 
300 
3S0 
400 
450 
500 
550 
£00 

eso 

700 
750 
800 

eso 

900 

sso 

1000 
1050 
1100 
XISO 
1200 
1250 
1300 
X350 
1400 
IdSO 
IbOO 
1.520 
loOO 
1650 
1700 
17S0 
1800 
1350 
1900 
1950 
200 
2050 
2100 
2150 
2200 
2250 
2300 
2350 
2400 
2450 
2500 
2550 . 
2500 
2 550 
2700 
2750 
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GTTTGC/JVTA 
CCTATTAAGC 
GGTTWrrACT 
CACGCXXATC 
CCGCW.TGGC 
GTGGC^TGTT 
AAGT^vCTTTT 
AAGTGGG?AG 

GTGCTTTTAA 
G?wTTGTGCAA 
CCtXTAJiwCTG 
CnC ITGCX^ 
CTTGCT^wTGT 
TG^wTTATGAA 
GCAAGAATTG 
CTCTTTTTTA 
CTGGTTATGT 
TCrCTGTJVCT 

•TAAXTGTTGA. 
GC?«CCXXGGG 
GGGGAAGAGA 



TCTTTTTGAA 
GTTAA2..GGTA 
TTGCCCCAAG 
J^TTTATGAAJiw 
^CAAGTXAAA 
ATCTCAG-rCG 
XTATCXTTCC, 
AGGTGTTGCX 

;5,JiJ»A.TTTGTT 
TATCTGCAGG 
GSGATGAGAC 
ACTACTGAAG 
TCTCTAGTAC 
ATTGATCTCT 
GCTCGGACCT 
GTTCCTCACC 
GTTGATTTAC 
CCTCAAG^AC 
CAACAGCCAG 
TCATGAGCAT 
TTAAGTACAT 
ACCGCAGTCT 



TTATGATTTA 
CTAAATGTAT 
TGGCAAACCT 
CTGAAGGTTC 
CAGGAAGGCA 
CATAAGCAAA 
^ l-rrTGAGCTT 
' TGAPACATGA 
TTTCTGAATT 
TTAAGAAACC 
TCTGGAACTG 
GGTAAGCTCA 
ACTAAGATAA 
ACTGCAAXAT 
TATAGGTACC 
HAAGTTAACA 
TTATATAGAT 
CTCCAATTTG 
TTGTATTAAT 
AACACGATGG 
CGGTCAGACA 
AACCTTTTAT 
GSGAGCTTGA 



TC-rrcTcccT 

GAAGCTGTCT 
TATCCTGATA 
CCTCGTGTCA 
AAGTCTTCTG 
TTATTAAACA 
AGTGGATGAT 
CACTTGTATC 
TCATATTATT 
GAAAAACTAC 
TGGTTGATGO 
GAAGTTGGTT 
TACTTGCrrC 
TGACTCTCCG 
CTATCATTCA 
TAACAATGGC 
CAAACTTTAA 
TTCTTTCTAA 
CTAAACGAGA 
AAGCGACGTA 
TCATAGCTAA 
GAAGACTCTG 
TAAAAGTGGG 



TGCATCTTAT 
GTCATAGGTT 
ATTGGA TCAT 
AGGTAATTTT 
TATCAGT CTA 
A.CTAAAATrr 
CAGTGGCTTA 
AAAGATAACT 
AGGAGTAGTC 
TTCATATCTT 
GAACGCTGGA 
TTGAAATTAT 
TCGAACACTG 

CTAcrrrrAT 

CTCTCTTGQT 
TCCCCAGGTA 
GTGTACTTTr 
A^TCATATA 

rrcTCATTGa 
CATGTGGAP1.C 

CATGACAAAA 
^AGCATTCC 
TATTAA 



2800 

28SQ 

2900 

29S0 

3000 

3050 

3100 

31S0 

3200 

3*250 

3300 

33S0 

3400 

3iS0 

3500 

3SS0 

3600 

3650 

3700 

3730 

3800 

3850 

389$ 
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CTGGGGCCAA AAGTGAA(».T AACAAGGACA CCACAGTCAG AGCATGATGT ' SO 

TCAGATGTAC AAGTGCATCT AAATATAGAG CATCAACATG CTGAAGATAT. 100 

CATTCCCAAT ATGACAAAGT TACCTACAAT GAAGTACATA ACCTATTATG ISO 

AGGATTCrCA AAGTTTTCCA GGGACAAGAA CAGCAGTTTG GGAGCTTGAT 200 

AAAGCAAATC ACAGGAACAT TGTCAGATCT CCAGCTTTGA TGCCGGAGCT 2S0 

GTGGCTTGAG ATGTGGCATG ATATTCATCC TGATAAAAAG TCCAAGTTTG 3 00 

TTACAAAAGG TGGTGTCTGA TCCTCACTAT TTTCTTCTAT AAATGTTTGA 3 SO 

GTTTGTATTG ACATTGTAAG ' TATTGCAACA AAAAGCAAAG CGTGGGCCTC 40 0 

TCAGGGATCA GGACTGCTAT TGCGATTACG GGAAAGCTCG ATGTGCATGG ^5 0 

GCTGAACATT GTGAATACAG GTTACAATAT TCA.*-ATTATA TTTTGCAAAA 5 00 

TATTCTCTTT TTGTGTATTT AGGGCACCTT TCCCCCGTCA CAACGATGCA SSO 

G7v?ATGTATT CGGGGATGTT CACCTGGGAC AGAGTTGCAG ATTGAAGAGT fiOO 

7CTACATCTC ACATCCTGTC ACAGTATGTG TGATATTTA?. GAAACTTTGT 6S0 
TTGGCGGAAC AACAAGTTTG CACAAACATT TGAAGAAGAA AGCGAAATGA 
TTCAGAGAG 



700 
709 



»ociD: <wo oosoogsAa i > 



SUBSTITUTE SHEET (RULE 26) 




THIS PA&E BLANK 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

Iniemational Bureau 

(43) International Publication Date 
12 October 2000 (12.10.2000) 




PCT 



(10) International Publication Number 

WO m/6m95 



(51) internalional Patent Classiflcation^: C12N 15/54, 
9/10, 15/81, 15/82, 1/16,5/10, AO IK 67/027, CI 2P 7/64 

(21) International Application Number: PCT/EPOO/02701 

(22) International Filing Date: 28 March 2000 (28.03.2000) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) 



Priority Data: 
99106656.4 
99111321.8 
60/180,687 



1 April 1999 (01.04.1999) EP 
10 June 1999 (10.06.1999) EP 
7 February 2000 (07.02.2000) US 



(71) Applicant (for ail designated Slates except US): BASF 
PLANT SCIENCE GMBH [DE/DE]; D-67056 Lud- 
wigshafen (DE). 

(72) Inventors; and 

(75) Inventors/Applicants (for US oniyj: DAHLQVIST, 
Anders [SE/SE]; Hemmansvagen 2, S-244 66 Furulund 
(SE). STAHL, Ulf [SE/SE]; LUjegatan 7b, S-753 24 
Uppsala (SE). LENMAN, Marit [SE/SE]; Revingegatan 
13a, S-223 59 Lund (SE). BANAS, Antoni [SE/PL]; 



Wiolinowa 14, PL-08110 Siedlce (PL). RONNE, Hans 
[SE/SE]; Dirigentvagen 169, S-756 54 Uppsala (SE). 
STYMNE, Sten [SE/SE]; Torrlosa 1380, S-269 90 Svalov 
(SE). 



(74) Agent: FITZNER, Uwc; Untorfer Str. 
Ratingen (DE). 



10. D-40878 



(81) Designated States (national): AE, AL, AM, AT, AU, AZ, 
BA, BB, BG, BR, BY, CA, CH, ON, CR, CU, CZ, DE, DK, 
DM, EE, ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, 
IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, 
LV. MA, MD, MG, MK, MN, IVIW, MX, NO. NZ, PL, FT, 
RO, RU, SD. SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, UA, 
UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, SD, SL, SZ, TZ, UG, ZW), Eurasian patent 
(AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent 
(AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, 
MC, NL, FT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— With international search report. 

f Continued on next page] 



(54) Title: ENZYMES OF THE BIOSYNTHETIC PATHWAY FOR THE PRODUCTION OF TRIACYLGLYCEROL AND RE- 
COMBINANT DNA MOLECULES ENCODING THESE ENZYMES 



80 



< 



X3 
XJ 

■o 
ra 



60 - 



CZ 

o 

•s 

E 

to 

f 



40 - 



20 - 



R. communis 



Sunflower 

ill 



f f f 




□ TAG 
HDAG 
E3FA 



C. palaestina 



to. 



<e <p <p 

•s^ ^ 



5^ 



e <p e 

<^ 



<3 ft 

s 

o 

£ 

I.- 4 

O (TJ 

73 o 



o 

XJ 

m 



2 - 



O 



B 



@ (57) Abstract: The present invention relates to the isolation, identification and characterization of nucleotide sequences encoding 
^ an enzyme catalysing the transfer of fatty acids from phospholipids to diacylglycerol in the biosyntfaetic pathway for the production 
^ of triacylglycerol, to the said enzymes and a process for the production of triacylglyceiols. 
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